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METHOD FOR SELECTING RECOMBINASE VARIANTS 
WITH ALTERED SPECIFICITY 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims benefit of U.S. Provisional Application No. 
60/127,977, filed April 6, 1999. Application Serial No. 60/127,977, filed April 
6, 1999, is hereby incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Recombinases, integrases and resolvases (collectively referred to herein 
as recombinases) mediate the site-specific recombination of DNA. These 
recombinases were first identified in phage that integrate into host 
chromosomes. Such integration allows the phage to remain latent in the cell as a 
prophage. 

Site-specific recombinases catalyze conservative DNA rearrangements at 
specific target sequences. The 38 kDa Cre recombinase (cyclization 
recombination), derived from the bacteriophage PI, is a well characterized and 
widely used enzyme of the Integrase family (reviewed by Sauer, Methods, 
14:381-392 (1998)). Cre plays two essential roles in the life cycle of PI : First, 
it provides a host-independent mechanism for PTs genome cyclization after 
infection, which can be important when the recombination system of the host is 
compromised. Second, Cre resolves dimerized PI prophage plasmids to 
guarantee proper segregation during cell division. 

Cre acts on a 34 bp sequence located on both ends of the linear PI 
genome, that is called lox? (locus of crossover of PI; Sternberg and Hamilton, J. 
Mol Biol, 150:467-486 (1981)), lox? consists of two 13 bp inverted repeats 
flanking a non-palindromic 8 bp core that defines the assigned direction of the 
sequence (as shown on the upper part of Figure 1). Depending on this direction, 
recombination catalyzed by Cre leads to excision of insertion of DNA flanked 
by loxV sites orientated in the same direction (indicated by /oxP^), but leads to 
inversion when oriented in the opposite direction (Figure 2). 
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In general, Cre-recombination involves the following four events: (/) 
DNA binding, (ii) synapsis (as defined below), (///) cleavage, and (iv) strand 
exchange. To study this process in greater detail, mutants defective for each 
step have been isolated using several screening procedures (Wierzbicki et al., J. 
5 MoL Biol, 195:785-794 (1987)). In addition, the crystal structure of Cre 
complexed with an artificial suicide substrate has been recently resolved, 
providing additional insights into site-specific recombination (Guo et al., 
Nature, 389:40-46 (1997)). From these studies, the following has been 
proposed: Four interacting Gre molecules are necessary for recombination 

10 between two lox sites, with each enzyme binding one inverted repeat plus the 
two outermost bp of the non-symmetric core region (DNA binding). This leads 
to the formation of a clamp, allowing DNA contacts in the major, as well as in 
the minor groove. In the step referred to as synapsis, the two lox sites with the 
bound Cre molecules, are aligned in parallel leading to an approximate lOO'' 

15 bending of the DNA. In the following step of strand cleavage, one of the two 
Cre molecules on each lox site causes a staggered cut in the core region, as 
indicated by the vertical arrows in Figure L This leads to a 6 bp 5' overhang 
and a covalent 3' phosphotyrosine linkage between the catalytic residue tyrosine 
324 of Cre and the guanine (position 4) at the cleaving site of loxV. The created 

20 phosphotyrosine intermediate is thought to provide the energy for the reaction, 
thereby explaining why Cre does not require an external energy source. In the 
next step, the first strand is exchanged between the two nicked lox sites, creating 
an intermediate, named Holliday structure (Sigal and Alberts, 1 Mol Biol, 
71 :789-793 (1972)). Of note, this first strand exchange is asymmetric, since the 

25 bottom strand (Figure 1) is always exchanged first (Hoess et al, Proc, Natl 

Acad Sci USA, 84:6840^6844 (1987)). During the final step, the second strand 
is exchanged and Cre released from its substrate. 

Because of the simplicity and the ability of Cre to fimction in yeast and 
mammalian cells (Sauer, B., Mol Cell Biol, 7:2087-2096 (1987); Sauer and 

30 Henderson, Proc. Natl Acad. Sci USA, 85:5166-5170 (1988), Sauer and 
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Henderson, Nucl Acids Res,, 17:147-161 (1989), and Sauer and Henderson, The 
New Biologist, 2:441-449 (1990), Cre assisted site-specific recombination has 
become an important tool for efficient, specific, and conditional manipulations 
of eukaryotic genomes (Lakso et al., Proc. Natl. Acad Sci. USA, 89:6232-6236 
(1992)): Kilby et aL, Genet., 9:413-421 (1993); Sauer, B., Metk enzymol, 
225:890-900 (1993); Ktihn et al. Science, 269:1427-1429 (1995); Metzger et aL, 
Pro. Natl Acad Sci. USA, 92:6991-6995 (1995). 

However, there are some inconveniences for the successful use of Cre- 
related technologies, that include the following: (f) lox sites need to be 
introduced by homologous recombination at the desired region into the genome 
before Cre can be used, (zz) the frequency of correct site-specific recombination 
due to Cre expression is not 100%, and consequently, {Hi) selectable markers are 
necessary in most strategies involving Cre for genome manipulation in higher 
eukaryotes. These markers, e.g. neo or TK, may introduce problems in 
subsequent studies, particular in those related to animal development. The 
number of available selectable markers that can be used in limited also. 
Additional site-specific recombinases that also function efficiently in eukaryotic 
systems, but recognize different sites fi-om lox would be helpful. Similar 
inconveniences limit the usefulness of other recombinases. 

Therefore, it is an object of the present invention to provide a method of 
identifying variant recombinases that can mediate recombination between 
variant recombination sites. 

It is another object of the present invention to provide variant 
recombinases that can mediate recombination between variant recombination 
sites. 

It is another object of the present invention to provide a method of 
recombining nucleic acid molecules in vitro and in vivo. 

It is another object of the present invention to-provide Cre variants that 
recognize variant recombination sites. 
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BRIEF SUMMARY OF THE INVENTION 
Disclosed is a method for identifying variant forms of recombinases that 
can mediate recombination between variant recombination sites. The method 
involves producing mutant recombinases and testing the mutant recombinases 
with specially designed constructs. The constructs contain variant 
recombination sites that are not recognized by non-mutant recombinase but will 
undergo recombination in the presence of a mutant recombinase with altered 
specificity. Recombination at the variant recombination sites can be monitored 
or detected by any suitable means. It is preferred that recombination is detected 
by screening or selection based on the expression or lack of expression of a 
reporter gene. This can be accomplished by using constructs containing a 
reporter gene associated with the variant recombination sites such that the 
reporter gene is rearranged or deleted, or a spacer sequence interrupting the 
reporter gene is rearranged or deleted, as a result of recombination at the 
recombination sites. Recombination of such constructs will result in a loss of 
expression of the reporter gene, where the construct contained a functional 
reporter gene, or in a gain in expression of the reporter gene, where the construct 
contained a non-functional reporter gene. 

The disclosed method also involves determining whether a variant 
recombinase retains the ability to mediate recombination at recombination sites 
recognized by non- variant recombinase. This can be accomplished by using 
constructs containing recombination sites recognized-by non- variant 
recombinase. Recombination at these recombination sites can be monitored or 
detected by any suitable means. It is preferred that recombination is detected by 
screening or selection based on the expression or lack of expression of a reporter 
gene. This can be accomplished by using constructs containing a reporter gene 
associated with the recombination sites recognized by non- variant recombinase 
such that the reporter gene is rearranged or deleted, or a spacer sequence 
interrupting the reporter gene is rearranged or deleted, as a result of 
recombination at the recombination sites. Recombination of such constructs 
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will result in a loss of expression of the reporter gene, where the construct 
contained a functional reporter gene, or in a gain in expression of the reporter 
gene, where the construct contained a non-functional jeporter gene. 

When variant recombinases are tested for activity on both variant 
5 recombination sites and recombination sites recognized by non-variant 
recombinase in the same system or at the same time, it is preferred that two 
different reporter genes, which can be separately detected or monitored, be used. 
In this case, the first reporter gene can be associated with the variant 
recombination sites and the second reporter gene can be associated with 

1 0 recombination sites recognized by non- variant recombinase. 

Recombination between two recombination sites requires (1) that the 
recombinase recognize the sites as recombination sites, and (2) that the 
sequences of the two sites is sufficiently similar. It has been discovered that 
recombination between two recombination sites (both of which are recognized 

15 by a recombinase) can be substantially reduced or prevented by using different 
compatibility sequences for the recombination sites (the recognition sequences 
can also differ if the recombinase can recognize different sequences). Thus, it is 
also preferred that the variant recombination sites be made incompatible with 
the recombination sites recognized by non-variant recombinase by using 

20 different compatibility sequences for the two sets of recombination sites. 

Compatibility sequences in a recombination site are those sequences in 
the recombination site, other than the sequences required for recognition of the 
site by the recombinase, that must be similar in a pair of recombination sites for 
recombination to occur between them. Many recombination sites contain 

25 repeats of a characteristic sequence separated by spacer sequences. In such 

recombination sites, the spacer sequences are generally compatibility sequences 
and the repeats (or parts of the repeats) are recognition sequences. 
Recombinases require specific recognition sequences but allow wide variation in 
compatibility sequences. Thus, recombination sites that are recognized by a 
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given recombinase but are incompatible with each other can be readily designed 
using the disclosed principles. 

Also disclosed are variant recombinases made or identified by the 
disclosed method that have broadened specificity for the site of recombination. 
5 Specifically, the disclosed variants mediate recombination between sequences 
other than recombination sites on which the wild type recombinase is active. In 
general, the disclosed recombinase variants can mediate efficient recombination 
between recombination sites that wild type recombinase can act on (referred to 
as wild type recombination sites), between variant recombination sites not 

1 0 efficiently utilized by wild type recombinase (referred to as variant 

recombination sites), and between a wild type recombination site and a variant 
recombination site. 

Also disclosed are methods of recombining nucleic acids using the 
disclosed variant recombinases. For example, the disclosed variant 

1 5 recombinases can be used in any method or technique where wild type 

recombinases can be used. In addition, the disclosed variant recombinases allow 
different alternative recombinations to be performed since the variant 
recombinases can allow much more efficient recombination between wild type 
recombination sites and variant recombination sites. Control of such alternative 

20 recombination can be used to accomplish more sophisticated sequential 

recombinations to achieve results not possible with wild type recombinases. 
The disclosed variant recombinases also allow recombination at specific 
genomic sites without the need to first introduce a recombination site. 

Also disclosed are variants of Cre recombinase that have broadened 

25 specificity for the site of recombination. Specifically, the disclosed variants 
mediate recombination between sequences other than the loxP sequence and 
other lox site sequences on which wild type Cre recombinase is active. In 
general, the disclosed Cre variants mediate efficient recombination between lox 
sites that wild type Cre can act on (referred to as wild type lox sites), between 

30 variant lox sites not efficiently utilized by wild type Cre (referred to as variant 
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lox sites), and between a wild type lox site and a variant lox site. Also disclosed 
are methods of recombining nucleic acids using the disclosed Cre variants. For 
example, the disclosed Cre variants can be used in any method or technique 
where Cre recombinase (or other, similar recombinases such as FLP) can be 
5 used. In addition, the disclosed Cre variants allow different alternative 
recombinations to be performed since the Cre variants allow much more 
efficient recombination between wild type lox sites and variant lox sites. 
Control of such alternative recombination can be used to accompUsh more 
sophisticated sequential recombinations to achieve results not possible with wild 
1 0 type Cre recombinase. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a comparison of three different lox sites. loxP is the original 
recombination site for Cre recombinase. loxKl and loxK2 are variant lox sites. 
Figure 2 is a diagram of two different forms of construct and the 
1 5 resulting recombination products. 

Figure 3 is a diagram of an example of a random mutagenesis using 
DNA shuffling. 

Figure 4 is a diagram of the selection plasmid for loxK2 recombination, 
pBS584. Recombination of two loxK2 sites by a potent Cre mutant will result in 

20 the excision of EGFP and the transcriptional terminator rrnBTx T2. 

Subsequently, neo transcription can take place, rendering E. coli resistant to 
kanamycin. Note that the promoter (pRSV) even though of eukaryotic origin 
was shown to be functional in E. coli (Antonucci et al., J, Biol Chem,, 
264:17656-17659(1989)). 

25 Figure 5 depicts gels of nucleic acid fragments and PCR products 

generated during the DNA shuffling process. 

Figure 6 is a diagram of plasmid pBAD33 used for expression of mutant 
cre pools. 

Figure 7 is a diagram of the construction of selection plasmids pBS568 
30 and pBS569. 
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Figure 8 is a diagram of the construction of selection piasmids pBS583 
and pBS584. 

Figure 9 is a diagram of control plasmid pBS613. 

Figure 10 is a diagram of the construction of screening piasmids pBS601 
5 andpBS602. 

Figure 11 is a diagram of examples of basic types of constructs useful in 
the disclosed method. These types of constructs are: (1) interrupted constructs 
where the gene is interrupted by a nucleic acid segment (which is flanked by 
recombination sites) that is deleted during recombination, (2) flanked constructs 
10 where the gene as a unit is flanked by recombination sites and the gene is 

deleted by recombination, and (3) inverted constructs where a portion of the 
gene is on an inverted nucleic acid segment and recombination causes the 
segment to invert and reconstitute the intact gene. The type of recombination is 
indicated in parentheses. 
15 Figures 12 A, 12B, and 12C are diagrams of examples of constructs and 

their expected recombination when used in the disclosed method. Figure 12A 
shows examples of deletion constructs (flanked and interrupted). Figure 12B 
shows examples of inverted constructs. Figure 12C shows examples of 
constructs that combine through recombination to reconstitute an intact gene. 
20 Figure 13 is a diagram showing the identified amino-acid changes in the 

six selected Cre mutants are listed according to their position in the protein's 
secondary structure (silent mutations in parenthesis). Only one amino acid 
change, E262G, is common to all mutants with remarkably increased /oxK2 
activity (R3 Ml, 2, 3, 5, and 6), suggesting that this mutation is essential for the 
25 observed phenotype. 

Figure 14A is a table comparing recombination frequencies in vivo 
obtained with a variety of lox sites altered at positions 1 1 and 12. Figure 14B is 
a table comparing recombination frequencies in vivo obtained with identical and 
mixed lox sites. Wild type Cre and five different mutant enzymes were tested 
30 for their performance on different lox^ substrates, as indicated. Given are the 
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obtained percentages of recombination in vivo based on the described negative 
selection. 

Figure 15 is a table comparing recombination frequencies in vitro 
obtained with a variety of /ox sites altered at positions 1 1 and 12. 
5 Figure 16 is a graph of percent of various Cre recombinases (wt, G, GA, 

GN, GS, R3M3) bound to various lox sites (loxP, loxK2, loxKl). 

Figure 17 shows wildtype and target FRT sites. 

Figure 18 shows the strategy for selection of altered specificity FLP 
mutants. 

10 Figure 19 shows an alternate target mutant FRT site. The design and 

rationale for design of the target mutant FRT site is as described in Figure 17, 
but the mutant FRT-M2 site differs from FRT-M by carrying a different 
mutational alteration in the repeat elements. 

DETAILED DESCRIPTION OF THE INVENTION 

1 5 Disclosed is a method for identifying variant forms of recombinases that 

can mediate recombination between variant recombination sites. The method 
involves producing mutant recombinases and testing the mutant recombinases 
with specially designed constructs. The constructs contain variant 
recombination sites that are not recognized by non-mutant recombinase but will 

20 undergo recombination in the presence of a mutant recombinase with altered 
specificity. The disclosed method also involves determining whether a variant 
recombinase retains the ability to mediate recombination at recombination sites 
recognized by non- variant recombinase. 

When variant recombinases are tested for activity on both variant 

25 recombination sites and recombination sites recognized by non- variant 

recombinase in the same system or at the same time, it is preferred that two 
different reporter genes which can be separately detected or monitored be used. 
In this case, a first reporter gene can be associated with the variant 
recombination sites and a second reporter gene can be associated with 

30 recombination sites recognized by non- variant recombinase. It is also preferred 
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that the variant recombination sites be made incompatible with the 
recombination sites recognized by non-variant recombinase by using different 
compatibility sequences for the two sets of recombination sites. This allows 
separate assessment of the ability of a variant recombinase to mediate 
recombination between variant recombination sites and recombination sites 
recognized by non- variant recombinase. 

Also disclosed are variant recombinases made or identified by the 
disclosed method that have broadened specificity for the site of recombination. 
Also disclosed are methods of recombining nucleic acids using the disclosed 
variant recombinases. For example, the disclosed variant recombinases can be 
used in any method or technique where wild type recombinases can be used. In 
addition, the disclosed variant recombinases allow different alternative 
recombinations to be performed since the variant recombinases can allow much 
more efficient recombination between wild type recombination sites and variant 
recombination sites. Control of such alternative recombination can be used to 
accomplish more sophisticated sequential recombinations to achieve results not 
possible with wild type recombinases. 

Also disclosed are variants of Cre recombinase that have broadened 
specificity for the site of recombination. Specifically, the disclosed variants 
mediate recombination between sequences other than the loxP sequence and 
other lox site sequences on which wild type Cre recombinase is active. 
Preferred forms of the disclosed Cre variants have the amino acid sequence SEQ 
ID N0:1 (top sequence. Table 11) altered by one or more amino acid 
substitutions, deletions, or insertions, where the glutamic acid at amino acid 262 
has been substituted with an amino acid other than glutamic acid, and where the 
Cre variant recognizes (that is, mediates recombination at) a variant lox 
recombination site. Useful Cre variants include proteins that recognize a variant 
lox recombination site and have the amino acid sequence SEQ ID N0:1 altered 
by substitution of the glutamic acid at amino acid 262 with an amino acid other 
than glutamic acid and one or more of the following amino acid substitutions: 
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isoleucine at amino acid 16, alanine at amino acid 29, glutamine at amino acid 
101, glycine at amino acid 138, asparagine at amino acid 189, serine at amino 
acid 198, glutamine at amino acid 220, glutamine at amino acid 223, isoleucine 
at amino acid 227, glycine at amino acid 254, arginine at amino acid 255, 
5 glutamine at amino acid 284, leucine at amino acid 307, and serine at amino acid 
316. Preferred amino acid substitutions at amino acid position 262 include 
alanine, tryptophan, or glycine. 

Examples of preferred Cre variants include proteins having the amino 
acid sequence SEQ ID NO: 1 altered by substitutions E262G and Dl 89N; 

10 proteins having the amino acid sequence SEQ ID N0:1 altered by substitutions 
E262G and T316S; proteins having the amino acid sequence SEQ ID N0:1 
altered by substitutions E262G and D29A; proteins having the amino acid 
sequence SEQ ID NO:l altered by substitutions E262G, V16I, D189N, G198S, 
R223Q, Q255R, and P307L; proteins having the amino acid sequence SEQ ID 

15 N0:1 altered by substitution E262G; proteins having the amino acid sequence 
SEQ ID N0:1 altered by substitution E262A; and proteins having the amino 
acid sequence SEQ ID NO:! altered by substitution E262W. The substitutions 
above are listed using the convention where the first letter is the original amino 
acid (in single letter amino acid code), the number is the amino acid position in 

20 the protein (in this case, using the positions of wild type Cre (SEQ ID NO: 1 )), 
and the last letter is the new amino acid (in single letter amino acid code). All of 
these Cre variants recognize both wild type lox sites and variant lox sites with an 
inverted repeat sequence NNNACNNCGTATA (SEQ ID N0:2). 

The disclosed Cre variants recognize variant lox recombination sites. 

25 Preferred variant lox sites are variant lox sites recognized by the Cre variant but 
not recognized by wild type Cre. Examples of useful variant lox sites include 
sites having two 13 base pair inverted repeats flanking 8 base pairs, where one 
of the inverted repeats has the sequence NNNACNNCGTATA (SEQ ID N0:2); 
sites having the sequence N1N2N3ACN4N5CGTATANNNNNNNNTATA 

30 CGN5'N4'GTN3'N2'Ni' (SEQ ID N0:3), where Nf, N2', N3', N4', and N5' are 
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complementary to Ni, N2, N3, N4, and N5, respectively; sites having the 
sequence NiN2N3ACN4N5CGTATANNNNNNNNTATACGN5'N4'GN3'N2'Nr 
(SEQ ID N0:3), where N4N5 are AA, TC, GT, TG, GG, or CC; and sites having 
the sequence GATACAACGTATATACCTTTCTATACGTTGTAT (SEQ ID 
5 N0:4). 

Also disclosed is a method for producing site-specific recombination of 
DNA in cells using the disclosed Cre variants. DNA sequences comprising first 
and second lox sites are introduced into cells and contacted with a Cre variant, 
thereby producing recombination at the lox sites. As with wild type Cre, the 
10 location and orientation of the lox sites determines the nature of the 
recombination. 

As used herein, the expression "site-specific recombination" refers to 
three different types of recombination events: 

1 . deletion of a pre-selected DNA segment flanked by recombination 

15 sites, 

2. inversion of the nucleotide sequence of a pre-selected DNA segment 
flanked by recombination sites, and 

3. reciprocal exchange of DNA segments proximate to recombination 
sites located on different DNA molecules, 

20 It is to be understood that this reciprocal exchange of DNA segments can 

result in an integration event if one or both of the DNA molecules are circular. 
"Nucleic acid segment" refers to a linear segment of single- or double-stranded 
nucleic acid, which can be derived fi-om any source. The segment may be a 
fragment consisting of the segment or a segment within a larger nucleic acid 

25 fragment or molecule. The expression "nucleic acid in eukaryotic cells" 

includes all nucleic acid present in eukaryotic cells. The expression "nucleic 
acid in yeast" includes all nucleic acid present in yeast cells. "DNA segment" 
refers to a linear segment of single- or double-stranded deoxyribonucleic acid 
(DNA), which can be derived from any source. The expression "DNA in 

30 eukaryotic cells" includes all DNA present in eukaryotic cells. The expression 
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"DNA in yeast" includes all DNA present in yeast cells. As used herein, a 
"gene" is intended to mean a DNA segment which is normally regarded as a 
gene by those skilled in the art. The expression "regulatory molecule" refers to a 
polymer of ribonucleic acid (RNA) or a polypeptide which is capable of 
5 enhancing or inhibiting expression of a gene. 

"Regulatory nucleotide sequence," as used herein, refers to a nucleotide 
sequence located proximate to a gene whose transcription is controlled by the 
regulatory nucleotide sequence in conjunction with the gene expression 
apparatus of the cell. Generally, the regulatory nucleotide sequence is located 5' 

10 to the gene. The expression "nucleotide sequence" refers to a polymer of DNA 
or RNA, which can be single- or double-stranded, optionally containing 
synthetic, non-natural, or altered nucleotides capable of incorporation into DNA 
or RNA polymers. As used herein, a "regulatory nucleotide sequence" can 
include a promoter region, as that term is conventionally employed by those 

1 5 skilled in the art. A promoter region can include an association region 

recognized by an RNA polymerase, one or more regions which control the 
effectiveness of transcription initiation in response to physiological conditions, 
and a transcription initiation sequence. "Gene product" refers to a polypeptide 
resulting from transcription, translation, and, optionally, post-translational 

20 processing of a selected DNA segment. 

Materials 

A, Recombinases 

Recombinases suitable for use in the disclosed method include any 
enzyme that mediates recombination at specific sites. This includes enzymes 

25 identified as recombinases as well as other enzymes that function to produce 

recombination such as integrases and resolvases. As used herein, recombination 
at specific sites does not refer only to recombination at completely defined 
sequences. Rather, a recombinase is considered to mediate recombination at 
specific sites when the sites of recombination are limited in some way by 

30 sequence. For example, wild type Cre recombinase mediates recombination 
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between sites having the sequence NiN2N3ACTTCGTATANNNNNNNNT 
ATACGAAGTNs'Na'Ni', which includes both specific and non-specific 
sequences. The sequences ACTTCGTATA and TATACGAAGT (an inverted 
repeat of the first sequence) are recognized by the Cre recombinase. The non- 
5 specific sequences (positions with "N" in the recognition sequence), although 
not limited in sequence, must be compatible with the non-specific sequences of 
the partner recombination site in order for recombination to be efficient. The 
recombination sites need not have any particular number of specific nucleotides. 
All that is required is some constraint on the sequence of the site such that the 

10 recombinase is limited to recombination at some set of sites. 

Examples of recombinases that can be used in the disclosed method 
include Cre recombinase, FLP recombinase, Beta recombinase of pSM 19035 
(Diaz et al, J Biol Chem 274: 6634-6640 (1999)), Int recombinases (Nunes- 
Doby et al., Nucleic Acids Res. 26:391-406 (1998)), and resolvases (Hallet et al, 

15 FEUS Microbiol Rev, 21: 157-178 (1997); Oramet al,, Curr Biol 5: 1106-1109 
(1995); Mondragon, Structure 3: 755-758 (1995)). 
B. Recombination Sites 

Recombination sites are locations within a nucleic acid where 
recombination mediated by a recombinase takes place. Recombination sites 

20 generally include specific sequences, referred to as recognition sequences, 
through which the recombinase recognizes a given nucleotide sequence as a 
recombination site. Different recombinases generally recognize different 
recognition sequences. Recombination between two recombination sites 
requires (1) that the recombinase recognize the sites as recombination sites, and 

25 (2) that the sequences of the two sites are sufficiently similar. It has been 

discovered that recombination between two recombination sites (both of which 
are recognized by a recombinase) can be substantially reduced or prevented by 
using different compatibility sequences for the recombination sites (the 
recognition sequences can also differ if the recombinase can recognize different 

30 sequences). Thus, it is also preferred that the variant recombination sites be 



9973 58vl 



14 



OMRFi78 
204S7/247 



made incompatible with the recombination sites recognized by non- variant 
recombinase by using different compatibility sequences for the two sets of 
recombination sites. Compatibility sequences in a recombination site are those 
sequences in the recombination site, other than the sequences required for 
5 recognition of the site by the recombinase, that must be similar in a pair of 
recombination sites for recombination to occur between them. Generally, 
recombinases require specific recognition sequences but allow wide variation in 
compatibility sequences. Thus, recombination sites that are recognized by a 
given recombinase but are incompatible with each other can be readily designed 

1 0 using the disclosed principles. 

It should be understood that, for a given recombinase site or for a given 
recombinase, whether a given base position in the recombination site is a 
recognition sequence base or a compatibility sequence base may depend on 
other sequences in the recombination site. For example, a particular base may 

1 5 function as a compatibility sequence base in a recombination site having one 
sequence while the same base may function as a recognition sequence base in a 
recombination site having a different sequence. It should also be understood 
that recognition sequences and compatibility sequences do not necessarily occur 
in blocks within a recombination site. That is, recognition sequence base and 

20 compatibility sequence bases may be interspersed in a given recombination site. 
As discussed below, what is and is not a recognition sequence or a compatibility 
sequence in a given recombination site is determined functionally. 

The disclosed variant recombination sites and the variant recombinases 
that can act on them allow more freedom in the selection of sites of 

25 recombination. In particular, the disclosed variant recombinases can allow 
amino acid changes in a protein of interest while retaining the ability to 
recombine at a given site. 

1. Recognition Sequences 

Recognition sequences are regions within a recombination site that must 
30 have a specific sequence, or defined range of sequences, for the cognate 
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recombinase to recognize the recombination site. Recognition sequences in a 
recombination site need not be contiguous. Thus, required nucleotides dispersed 
in a recombination site are collectively considered recognition sequences. 
Nucleic acid segments can be said to have a defined range of sequences when 
5 every nucleotide position in the nucleic acid segment(s) is limited to one, two, or 
three nucleotide bases. That is, so long as a nucleotide position cannot have one 
of the possible nucleotide bases, that position has a defined range of sequence. 
For example, a nucleotide sequence ATRVB YGC has a defined range of 
sequences since each nucleotide position has at least one limitation. Standard 

10 nomenclature for nucleic acid sequences is used herein. Thus, in this example, 
R represents A or G; V represents A, C, or G; B represents C, G, or T; and Y 
represents C or T. 

Recognition sequences for recombinases are known or can be 
determined through routine analysis. In general, recognition sequences can be 

15 determined by varying the sequence of recombination sites and determining if 
recombination between the sites can still occur. For this purpose, the pair of 
sites to be recombined should be identical. That is, the same sequence changes 
should be made to both sites. This eliminates any incompatibility effect 
between the recombination sites. If recombination is eliminated or significantly 

20 reduced when a specific nucleotide is changed, then that nucleotide is required 
for recognition of the recombination site by the recombinase. 

Examples of dissection of the critical sequences in recombination sites of 
recombinases are described by Hoess et al.. Nucleic Acids Res. 14:2287-2300 
(1986) (involving PI recombinase); Sauer B., Nucleic Acids Res., 24:4608-4613 

25 (1996) (involving Ore recombinase); Lee and Saito, Gene 216(l):55-65 (1998) 
(involving Cre recombinase); and Umlauf and Cox, EMBO J 7(6): 1 845-52 
(1988) (involving FLP recombinase). Similar techniques can be used to 
determine the recognition sequences of other recombinases. 
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2. Compatibility Sequences 

Compatibility sequences are regions in a recombination site that must be 
similar in a pair of recombination sites for recombination to occur between 
them. In general, the sequence of recombination sites must be similar for 
5 recombination to occur between them. Examples of compatibility sequences are 
spacer sequences between repeats in recombination sequences. All or some of 
the nucleotides in the recognition sequences for a recombination site may be 
involved in compatibility. For example, where some degeneracy of the 
recognition sequences is allowed, similar recognition sequences may be required 

10 in a pair of recombination sites for recombination to occur between them. Thus, 
compatibility between recombination sites can be affected by using different 
sequences in the compatibility sequences other than the sequences required for 
recognition of the site by the recombinase (that is, recognition sequences), 
compatibility sequences that are part of the recognition sequences, or both. It is 

1 5 preferred that compatibility between recombination sites be altered by using 
different sequences in the compatibility sequences other than the sequences 
required for recognition of the site by the recombinase. 

Compatibility sequences for many recombinases are loiown or can be 
determined through routine analysis. In general, compatibility sequences can be 

20 easily determined by varying the sequence of recombination sites and 

determining if recombination between the sites can still occur. For this purpose, 
only one of the sites in the pair of sites to be recombined should be altered. That 
is, the same sequence changes should not be made to both sites. This isolates 
incompatibility effect between the recombination sites. Further, only those 

25 nucleotide positions that are not a part of the recognition sequence of the site 

should be altered to avoid recognition problems. If recombination is eliminated 
or significantly reduced when a specific nucleotide is changed, then that 
nucleotide is required for compatibility of the recombination site. 

Examples of dissection of the critical sequences in recombination sites of 

30 recombinases are described by Hoess et al.. Nucleic Acids Res. 14:2287-2300 
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(1986) (involving PI recombinase); Sauer B., Nucleic Acids Res., 24:4608-4613 
(1996) (involving Cre recombinase); Lee and Saito, Gene 216(l):55-65 (1998) 
(involving Cre recombinase); and Umlauf and Cox, £MB(9 J 7(6): 1845-52 
(1988) (involving FLP recombinase). Similar techniques can be used to 
5 determine the compatibility sequences of other reconibinases. 

Recognition and compatibility sequences can be further understood using 
Cre recombination sites as an example. Wild type Cre recombinase mediates 
recombination between sites having the sequence N1N2N3ACTTCGTATANN 
NNNNNNTATACGAAGTNs'Na'Nf, which includes both specific and non- 
10 specific sequences (that is, recognition sequences and compatibility sequences, 
respectively). The sequences ACTTCGTATA and TATACGAAGT (an 
inverted repeat of the first sequence) are recognized by the Cre recombinase and 
are the recognition sequences in Cre recombinase sites. Variant Cre 
recombinases recognize sites having different recognition sequences. The non- 
15 specific sequences (positions with "N" in the recognition sequence), although 
not limited in sequence, must be compatible with the non-specific sequences of 
the partner recombination site in order for recombination to be efficient. Thus, 
the non-specific sequences are the compatibility sequences of a recombinase 
site, 

20 C. Recombination Constructs 

Recombination constructs are designed to provide an observable change 
when recombination between recombination sites occurs. Preferred 
recombination constructs include two pairs of recombination sites, one pair 
having a variant sequence and another pair having a sequence recognized by 

25 non-mutant recombinase (for example, wild type recombinase). Sites in the first 
pair are referred to as variant recombination sites. Generally, recombination 
constructs include a first nucleic acid sequence that includes a first reporter gene 
and first and second recombination sites, where the first and second 
recombination sites are variant recombination sites, and a second nucleic acid 

30 sequence that includes a second reporter gene and third and fourth 
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recombination sites, where the third and fourth recombination sites can be 
recombincd by a non-mutant recombinase. The first and second nucleic acid 
sequences need not be present on the same vector or on the same nucleic acid 
molecule (for example, the chromosome), although this is preferred. It is 
5 preferred that recombination constructs be embodied in vectors, such as 
plasmids. 

In one embodiment of the disclosed recombination constructs, the 
sequence of the recombination sites in the constructs are chosen such that the 
recognition sequences of the first and second recombination sites differ from the 

10 recognition sequences of the third and fourth recombination sites. The sequence 
of the recombination sites can also be chosen such that the compatibility 
sequences of the first and second recombination sites^differ from the 
compatibility sequences of the third and fourth recombination sites such that the 
first and second recombination sites cannot recombine with the third and fourth 

15 recombination sites. The sequence of the recombination sites can also be chosen 
such that the compatibility sequences of the first and second recombination sites 
are sufficiently similar to allow recombination between the first and second 
recombination sites, and such that the compatibility sequences of the third and 
fourth recombination sites are sufficiently similar to allow recombination 

20 between the third and fourth recombination sites. The above sequence 

relationships result in constructs where the first and second recombination sites 
can recombine (in the presence of a recombinase that recognizes the sites), the 
third and fourth recombination sites can recombine, but where the neither the 
first nor second recombination site can recombine with either the third or fourth 

25 recombination site (since differences in the compatibility sequences prevent 
recombination). 

Arriving at recombination sites having relationships as described above 
is preferably accomplished in the following way. Starting with a given 
recombination site sequence (which can be recombined by a non-mutant 
30 recombinase), parallel changes are made in the compatibility sequences of the 
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first and second recombination sites. These altered recombination sites should 
then be tested to make sure that the non-mutant recombinase can still mediate 
their recombination. This helps insure that compatibility sequence changes have 
not inadvertently affected the function of the recombination sites. Once this is 
confirmed, changes can be made to the recognition sequences of the first and 
second recombination sites. These changes result in variant recombination sites 
for which variant recombinases can be identified using the method disclosed 
herein. The resulting recombination sites have the desired propertied: 
incompatibility with the third and fourth recombination sites and variant 
recognition sequences that extend the range of recombination-competent sites. 

The recombination sites can have a variety of properties and 
relationships that make them useful for particular purposes. For example, the 
recombination sites can be designed such that the first and second recombination 
sites cannot be recombined by non-mutant recombinase to a significant extent. 
The allows separate assessment of cleavage by mutant and non-mutant 
recombinase. It is also useful if the first and second recombination sites have 
identical sequences, and the third and fourth recombination sites have identical 
sequences. 

Recombination between the recombination sites can have a variety of 
effects that allows detection of recombination. For example, the constructs can 
be designed such that recombination between the first and second recombination 
sites alters the expression of the first reporter gene, where recombination 
between the first and second recombination sites is determined by determining if 
expression of the first reporter gene is altered; recombination between the third 
and fourth recombination sites alters the expression of the second reporter gene, 
where recombination between the third and fourth recombination sites is 
determined by determining if expression of the second reporter gene is altered; 
recombination between the first and second recombination sites allows the first 
reporter gene to be expressed; the first nucleic acid sequence includes a spacer 
sequence flanked by the first and second recombination sites, where the spacer 
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sequence interrupts the first reporter gene such that the first reporter gene is not 
expressed, and where recombination of the first and second recombination sites 
excises the spacer sequence which allows the first reporter gene to be expressed; 
and/or a portion of the first reporter gene is inverted, wherein the inverted 
5 portion of the first reporter gene is flanked by the first and second recombination 
sites, wherein recombination of the first and second recombination sites inverts 
the inverted portion of the first reporter gene which allows the first reporter gene 
to be expressed. 

The constructs can also be designed such that recombination between the 

10 first and second recombination sites prevents expression of the first reporter 
gene; the first reporter gene is flanlced by the first and second recombination 
sites, where recombination of the first and second recombination sites excises 
the first reporter gene which prevents expression of the first reporter gene; a 
portion of the first reporter gene is flanked by the first and second recombination 

1 5 sites, where recombination of the first and second recombination sites inverts the 
flanked portion of the first reporter gene which prevents expression of the first 
reporter gene; recombination between the third and fourth recombination sites 
allows the second reporter gene to be expressed; and/or the second nucleic acid 
sequence includes a spacer sequence flanked by the third and fourth 

20 recombination sites, where the spacer sequence interrupts the second reporter 
gene such that the second reporter gene is not expressed, and where 
recombination of the third and fourth recombination sites excises the spacer 
sequence which allows the second reporter gene to be expressed. 

The constructs can also be designed such that a portion of the second 

25 reporter gene is inverted, where the inverted portion of the second reporter gene 
is flanked by the third and fourth recombination sites? and where recombination 
of the third and fourth recombination sites inverts the inverted portion of the 
second reporter gene which allows the second reporter gene to be expressed; 
recombination between the third and fourth recombination sites prevents 

30 expression of the second reporter gene to be expressed; the second reporter gene 
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is flanked by the third and fourth recombination sites, where recombination of 
the third and fourth recombination sites excises the second reporter gene which 
prevents expression of the second reporter gene; and/or a portion of the second 
reporter gene is flanlced by the third and fourth recombination sites, where 
recombination of the third and fourth recombination sites inverts the flanlced 
portion of the second reporter gene which prevents expression of the second 
reporter gene. 

Expression of a reporter gene can include transcription of the gene, 
translation of the transcript (if the gene encodes a protein), and/or production of 
an active protein. As used herein, whether a reporter gene is expressed depends 
on the context. In general, a gene is considered to be expressed if it produces 
the expression product to be detected. Such expression products include full or 
partial transcripts of the gene, full or partial proteins, including active or inactive 
forms of the proteins, translated from the transcript. Since the goal in using 
reporter genes in the disclosed method is the detection of expression, any of 
these forms of expression product can be the object of detection. For example, 
if the genets transcript is to be detected, the gene will be considered to be 
expressed if it produces the transcript, regardless of whether the transcript is 
translated or whether the resulting protein is active. If the an active protein 
encoded by the gene is to be detected, the gene is not^expressed unless active 
protein is produced-mere transcription of the gene, or even translation to 
produce an inactive protein, will not be enough in this context. As a 
consequence, the expression product to be detected will influence the maimer in 
which reporter genes should be interrupted or invented in the disclosed 
constructs. For example, nearly any interruption of a reporter gene would 
prevent expression of an active protein encoded by the gene. On the other hand, 
an interruption of the coding region will usually not prevent production of a 
transcript. The structure of the disclosed constructs should be designed with 
these principles in mind. As used herein, an inactive expression product refers 
to an expression product that does not have an activity exhibited by the active 
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form of the expression product where that activity is required for detection of 
expression in the assay scheme being used. 

The constructs can be designed such that the first nucleic acid sequence 
is a first nucleic acid construct and the second nucleic acid sequence is on a 
5 second nucleic acid construct; the first nucleic acid construct is an 

extrachromosomal vector and the second nucleic acid construct is in the genome 
of a host cell; and/or the first and second nucleic acid constructs are on the same 
nucleic acid construct, 
D, Reporter Genes 

1 0 Reporter genes are used to monitor whether recombination occurs in the 

disclosed constructs. Reporter genes can be any gene the expression of which 
can be detected either directly or indirectly. These include genes encoding 
enzymes, such as p-galactosidase, luciferase, and alkaline phosphatase, that can 
produce specific detectable products, and genes encoding proteins that can be 

1 5 directly detected. Virtually any protein can be directly detected by using, for 

example, specific antibodies to the protein. A preferred reporter protein that can 
be directly detected is the green fluorescent protein (GFP). GFP, from the 
jellyfish Aequorea victoria, produces fluorescence upon exposure to ultraviolet 
light without the addition of a substrate (Chalfie et al. Science 263:802-5 

20 (1994)). A number of modified GFPs have been created that generate as much 
as 50-fold greater fluorescence than does wild type GFP under standard 
conditions (Cormack et al. Gene 173:33-8 (1996); Zolotukhin et a/., 1 Virol 
70:4646-54 (1996)). This level of fluorescence allows the detection of low 
levels of expression in cells. 

25 Reporter genes encoding proteins producing a fluorescent signal are 

useful since such a signal allows cells to be sorted using FAGS. Another way of 
sorting cells based on expression of the reporter gene involves using the reporter 
protein as a hook to bind cells. For example, a cell surface protein such as a 
receptor protein can be bound by a specific antibody. Cells expressing such a 

30 protein can be captured by, for example, using antibodies bound to a solid 
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substrate, using antibodies bound to magnetic beads, or capturing antibodies 
bound to the reporter protein. Many techniques for the use of antibodies as 
capture agents are known and can be used with the disclosed method. 

The reporter gene can also encode an expression product that regulates 
5 the expression of another gene. This allows detection of expression of the 
reporter gene by detecting expression of the regulated gene. For example, a 
repressor protein can be encoded by the reporter gene. Loss of expression of the 
reporter gene (via recombination) would then result in derepression of the 
regulated gene. This type of indirect detection allows positive detection of loss 
1 0 of the expression of the reporter gene by the affector RNA molecule. One 
preferred form of this type of regulation is the use of an antibiotic resistance 
gene regulated by a repressor protein encoded by the reporter gene. By 
p exposing the host cells to the antibiotic, only those cells in which expression of 

iJJ the reporter gene has been inhibited will grow since expression of the antibiotic 

15 resistance gene will be derepressed. 
Q E. Expression Sequences 

The reporter genes can be expressed using any suitable expression 
- sequences. Numerous expression sequences are known and can be used for 

£ expression of the reporter genes. Expression sequences can generally be 

20 classified as promoters, terminators, and, for use in eukaryotic cells, enhancers. 
G Expression in prokaryotic cells also requires a Shine-Dalgamo sequence just 

upstream of the coding region for proper translation initiation. Inducible 
promoters are preferred for use with the first reporter gene since it is preferred 
that expression of the first reporter gene be adjustable. 
25 Promoters suitable for use with prokaryotic hosts illustratively include 

the p-lactamase and lactose promoter systems, tetracycline (tet) promoter, 
alkaline phosphatase promoter, the tryptophan (trp) promoter system and hybrid 
promoters such as the tac promoter. However, many other functional bacterial 
promoters are suitable. Their nucleotide sequences are generally laiovm. 
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Suitable promoting sequences for use with yeast hosts include the 
promoters for 3 -phosphogly cerate kinase, enolase, glyceraldehyde-3 -phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3 -phosphogly cerate mutase, pyruvate kinase, 
triosphosphate isomerase, phosphoglucose isomerase, and glucokinase. 
Examples of inducible yeast promoters suitable for use in the disclosed vectors 
include the promoter regions for alcohol dehydrogenase 2, isocytochrome C, 
acid phosphatase, degradative enzymes associated mth nitrogen metabolism, 
metallothionein, glyceraldehyde-3 -phosphate dehydrogenase, and enzymes 
responsible for mahose and galactose utilization. Yeast enhancers also are 
advantageously used with yeast promoters. 

Preferred promoters for use in mammalian host cells include promoters 
from polymoma virus. Simian Virus 40 (SV40), adenovirus, retroviruses, 
hepatitis B virus, herpes simplex virus (HSV), Rous sarcoma virus (RS V), 
mouse mammary tumor virus (MMTV), and most preferably cytomegalovirus 
(CMV), or from heterologous mammalian promoters such as the p actin 
promoter. Particularly preferred are the early and late promoters of the SV40 
virus and the immediate early promoter of the human cytomegalovirus, MMTV 
LTR, RSV-LTR, and the HSV thymidine kinase promoter. 

Transcription of the reporter gene by higher eukaryotes can be increased 
by inserting an enhancer sequence into the vector. Many enhancer sequences 
are now knovm from mammalian genes (globin, elastase, albumin, and insulin). 
Typically, however, one will use an enhancer from a eukaryotic cell virus. 
Examples include the SV40 enhancer on the late side of the replication origin, 
the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

The disclosed vectors preferably also contain sequences necessary for 
accurate 3' end formation of both reporter and affector RNAs. In eukaryotic 
cells, this would be a polyadenylation signal. In prokaryotic cells, this would be 
a transcription terminator. 
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Method 

A, Identification of Variant Recombinases 

The disclosed method involves producing mutant recombinases and 
testing the mutant recombinases with specially designed constructs. The 
constructs contain variant recombination sites that are not recognized by non- 
mutant recombinase but will undergo recombination in the presence of a mutant 
recombinase with altered specificity. The disclosed method also involves 
determining whether a variant recombinase retains the ability to mediate 
recombination at recombination sites recognized by non-variant recombinase. 
This can be accomplished by using constructs containing recombination sites 
recognized by non- variant recombinase. Recombination at these recombination 
sites can be monitored or detected by any suitable means. It is preferred that 
recombination is detected by screening or selection based on the expression or 
lack of expression of a reporter gene. This can be accomplished by using 
constructs containing a reporter gene associated with the recombination sites 
such that the reporter gene is rearranged or deleted, or a spacer sequence 
interrupting the reporter gene is rearranged or deleted, as a result of 
recombination at the recombination sites. Recombination of such constructs 
will result in a loss of expression of the reporter gene, where the construct 
contained a functional reporter gene, or in a gain in expression of the reporter 
gene, where the construct contained a non-functional reporter gene. 

1, Production of Mutant Recombinases 

Mutant recombinases can be produced by any suitable technique. In 
general, all that is required is a method of generating a variety of recombinase 
proteins having a variety of amino acid sequences, the most preferred way of 
doing this is to mutagenize or alter nucleic acid encoding the recombinase and 
then expressing the mutant recombinases. Numerous techniques for introducing 
alterations into nucleic acid sequences are known and can be used in the 
disclosed method. For example, alterations can be made by chemical 
mutagenesis, introduction of degenerate nucleic acid fragments into the base 
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nucleic acid molecule, and low fidelity PGR. The goal of this mutagenesis or 
alteration will be the generation of a population or set of mutant recombinases 
having a variety of sequences. The broader the range of variants, the more raw 
material for the identification process. 
5 2. Identification of Variant Recombinases That Recognize Variant 

Recombination Sites 

Variant recombinases that can mediate recombination at variant 
recombination sites are identified in the disclosed method by selecting for, 
screening for, or otherwise detecting recombination of specially designed 
1 0 constructs having variant recombination sites. Recombination at variant 

recombination sites can be monitored or detected by any suitable means. It is 
preferred that recombination is detected by screening or selection based on the 

0 expressionorlackof expression of a reporter gene. This can be accomplished 
Ul by ^sing constructs containing a reporter gene associated with the variant 

% 1 5 recombination sites such that the reporter gene is rearranged or deleted, or a 

■3 spacer sequence interrupting the reporter gene is rearranged or deleted, as a 

1 II result of recombination at the recombination sites. Recombination of such 
;L constructs will resuh in a loss of expression of the reporter gene, where the 

.jp construct contained a functional reporter gene, or in a gain in expression of the 

^ J 20 reporter gene, where the construct contained a non-functional reporter gene. 

3. Identification of Variant Recombinases That Recognize Non- 
Variant Recombination Sites 

Variant recombinases that can mediate recombination at recombination 
sites recognized by non- variant recombinase (non- variant recombination sites) 
25 are identified in the disclosed method by selecting for, screening for, or 

otherwise detecting recombination of specially designed constructs having 
recombination sites recognized by non-variant recombinase. Recombination at 
these recombination sites can be monitored or detected by any suitable means. 
It is preferred that recombination is detected by screening or selection based on 
30 the expression or lack of expression of a reporter gene. This can be 
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accomplished by using constructs containing a reporter gene associated with the 
recombination sites recognized by non- variant recombinase such that the 
reporter gene is rearranged or deleted, or a spacer sequence interrupting the 
reporter gene is rearranged or deleted, as a result of recombination at the 
5 recombination sites. Recombination of such constructs will resuh in a loss of 
expression of the reporter gene, where the construct contained a functional 
reporter gene, or in a gain in expression of the reporter gene, where the construct 
contained a non- functional reporter gene. 

It is preferred that the ability of a variant recombinase to mediate 

1 0 recombination at both variant recombination sites and recombination sites 

recognized by non- variant recombinase be assessed ill the same system (such as 
a cell strain) either sequentially or simultaneously. When variant recombinases 
are tested for activity on both variant recombination sites and recombination 
sites recognized by non- variant recombinase in the same system or at the same 

1 5 time, it is preferred that two different reporter genes which can be separately 
detected or monitored be used. In this case, a first reporter gene can be 
associated with the variant recombination sites and a second reporter gene can 
be associated with recombination sites recognized by non- variant recombinase. 
B, Use of Variant Recombinases 

20 Variant recombinases produced in the disclosed method can be used for 

any purpose that unmodified recombinases can be used. The advantage is that 
the variant recombinases have a different or broader site specificity. In general, 
the disclosed variant recombinases can be used to mediate recombination of any 
nucleic acid in any setting, including in vitro, in cell Culture, and in vivo. 

25 Recombination can be obtained in single celled organisms, such as bacterial 
cells, fungal cells, yeast cells, prokaryotic cells, and archae bacterial cells, the 
cells of multicellular organisms, including plants and animals, both in the 
organism and in culture. The disclosed variant recombinases can also be used in 
combination with other recombinases (including other variant recombinases) 

30 having a different site specificity. Such combinations allow more complex 



997358vl 



28 



OMRF178 
204S7/247 



recombination schemes to be used. Examples of such schemes are discussed 
below. 

For some uses of the disclosed recombinases, first, second, and fourth 
DNA sequences comprising a first recombination site, a second recombination 
site, and a third recombination site, respectively, are introduced into cells. As 
used herein the expression "recombination site" means a nucleotide sequence at 
which a recombinase or variant recombinase can catalyze a site-specific 
recombination. 

Methods for introducing a DNA sequence into cells are known in the art. 
These methods typically include the use of a DNA vector to introduce the 
sequence into the DNA of a single or limited number of eukaryotic cells and 
then growing such cell or cells to generate a suitable population of cells. As 
used herein, the term "vector" includes plasmids, viruses, and viral vectors. 
Preferably, the DNA sequences are introduced by a plasmid capable of 
transforming a selected cell while carrying a DNA sequence. The particular 
vector which is employed to introduce the DNA sequence into a selected cell is 
not critical. 

In the present method, the recombination sitea are contacted with a 
variant recombinase, thereby producing the site specific recombination. A 
preferred means of contacting the DNA to be recombined with a variant 
recombinase is to place the DNA to be recombined into a cell expressing nucleic 
acid encoding the variant recombinase. Preferably, expression of the variant 
recombinase is under the control of a regulatory nucleotide sequence. Suitable 
regulatory nucleotide sequences are known in the art. The regulatory nucleotide 
sequence which is employed with a selected eukaryotic cell is not critical to the 
method. A partial list of suitable regulatory nucleotide sequences includes the 
long terminal repeat of Moloney sarcoma virus described by Blochlinger and 
Diggelmann, Mol Cell Bio,, 4:2929-2931 (1984); the mouse metallothionein-I 
promoter described by Pavlakis and Hamer, Proc, Natl Acad Sci USA, 80:397- 
401 (1983); the long terminal repeat of Rous sarcoma virus described by 
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Gorman et al, Proc. Natl Acad Sci USA, 79:6777-6781 (1982); and the early 
region promoter of SV40 described by Southern and Berg, J. Mol Appl Genet., 
1:327-341 (1982). 

In an embodiment where the cells are yeast, suitable regulatory 
5 nucleotide sequences include GALl, GALIO, ADHl, CYCl, and TRP5 

promoters. GALl and GAL 10 promoters are present on plasmid pBMlSO which 
is described by Johnston and Davis, Molec, Cell Biol, 4:1440 (1984). The 
ADHl promoter, also called ADCl, is present on plasmid pAAH5 which is 
described by Ammer, Methods Enzymol , 1 0 1 : 1 92 (1 983). The CYC 1 promoter 

10 is described by Stiles et ah, Cell, 25:277 (1981), The TRP5 promoter is 

described by Zalkin and Yanofsky, J. Biol Chem,, 257:1491 (1982). Preferably, 
the regulatory nucleotide sequence is a GALl promoter. 

In one embodiment where the cell is yeast, the first, second, and 
optionally, third and fourth DNA sequences are introduced into one strain of 

15 yeast. Alternatively, the DNA sequences are introduced into two different 

strains of yeast of opposite mating types which are subsequently mated to form a 
single strain having all three or four DNA sequences. Preferably, the plasmid 
contains either (1) a nucleotide sequence of DNA homologous to a resident 
yeast sequence to permit integration into the yeast DNA by the yeast's 

20 recombination system or (2) a nucleotide sequence of DNA which permits 
autonomous replication in yeast. One nucleotide sequence which permits 
autonomous replication in yeast is an ARS sequence described by Stinchcomb et 
al, Nature, 282:39 (1979). A partial list of plasmids capable of transforming 
yeast includes YIPS, YRP17 and YEP24. These plasmids are disclosed and 

25 described by Botstein and Davis, The Molecular Biology of the Yeast 

Saccharomyces, Metabohsm and Gene Expression (ed. Strathern et al), (Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1982), at page 607. 

Since most recombination sites are asymmetrical nucleotide sequences, 
two recombination sites on the same DNA molecule can have the same or 

30 opposite orientations with respect to each other. Recombinations between 
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recombination sites in the same orientation result in a deletion of the DNA 
segment located between the two recombination sites and a connection between 
the resulting ends of the original DNA molecule. The deleted DNA segment 
forms a circular molecule of DNA. The original DNA molecule and the 
5 resulting circular molecule each contain a single recombination site (see Figure 
2). Recombination between recombination sites in opposite orientations on the 
same DNA molecule result in an inversion of the nucleotide sequence of the 
DNA segment located between the two recombination sites (see Figure 2). In 
addition, reciprocal exchange of DNA segments proximate to recombination 

10 sites located on two different DNA molecules can occur. All of these 

recombination events are catalyzed by recombinases, including the disclosed 
variants and wild type recombinases. 

Recombination using the disclosed variant recombinases can be used in 
vitro to produce site-specific recombination of nucleic acid molecules. This is 

1 5 useful for a wide variety of manipulations that currently employ wild type 
recombinases or involve traditional restriction enzyme cleavage followed by 
ligation. Examples include recombination of hbraries of DNA fragments into 
vectors or in desired structures, and labeling of DNA via recombination. 
Recombined DNA formed by in vitro recombination pan then be introduced into 

20 cells. For example, constructs formed in vitro can be introduced into cells to 

resolve the structures formed in vitro or to select active structures. In particular, 
large concatemers of subject DNA and spacer/vector fragments can be made, 
introduced into cells, and circularized into vector units in the cells. Such 
recombination could also be performed in vitro if desired. 

25 The disclosed variant recombinases can be used to label DNA by 

recombining a DNA molecule of interest with a labeled DNA molecule. Use of 
a recombinase for labeling is advantageous since it involves fewer steps than 
traditional labeling via DNA synthesis or ligation. These considerations are 
particularly important when large DNA molecules (over 20 kb) are to be labeled 

30 since such large molecules will fragment more the more they are manipulated. 
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Recombination mediated by the disclosed variant recombinases and 
variant recombination sites can be used to manipulate a host cell genome as 
desired and simultaneously introduce a marker gene flanked by the recognition 
sites of a second recombinase. After selection, leading to an accumulation of 
5 cells carrying the desired genomic alteration, one could simply remove the 
marker gene by expression of the second site-specific recombinase, A large 
number of recombinases suitable for this purpose exists in nature, including X 
Integrase (Int), yeast Flp, etc. (Nunes-Doby et al., NucL Acids Res., 26:391-406 
(1998)). Variant recombinases having different site specificity can also be used, 

1 0 Since the disclosed variant recombinases recognize both wild type 

recombination sites and variant recombination sites that are not recognized by 
wild type recombinase, wild type recombinase and variant recombinases can be 
used to mediate sequential recombination between nucleic acids containing a 
combination of wild type recombination sites and variant recombination sites. 

15 For example, generation of knockout animals and plants can be made more 
efficient by using a structure wild type recombination site-selectable marker- 
wild type recombination site-endogenous gene-variant recombination site (rather 
than the conventional wild type recombination site-selectable marker- wild type 
recombination site-endogenous gene-wild type recombination site). Such a 

20 structure allows the selectable marker to be removed by the action of wild type 
recombinase without disturbing the gene since wild type recombinase will not 
recognize the variant recombination site to any significant degree. The 
endogenous gene can then be removed later by the action of a variant 
recombinase since the disclosed variant recombinases recognize both wild type 

25 and variant recombination sites. 

In a preferred embodiment of the disclosed method, the first and second 
DNA sequences are introduced into cells connected by a pre-selected DNA 
segment. The segment can be a gene or any other sequence of 
deoxyribonucleotides of homologous, heterologous or synthetic origin. 

30 Preferably, the pre-selected DNA segment is a gene for a structural protein, an 
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enzyme, or a regulatory molecule. If the first and second recombination sites 
have the same orientation, activation of the regulatory nucleotide sequence 
produces a deletion of the pre-selected DNA segment. If the first and second 
recombination sites have opposite orientation, activation of the regulatory 
5 nucleotide sequence produces an inversion of the nucleotide sequence of the 
pre-selected DNA segment. 

If a fourth DNA sequence (containing the third recombination site) is 
also introduced into cells, it is preferred that the second and fourth DNA 
sequences be introduced into cells connected by a second pre-selected DNA 

1 0 segment. The second segment can be a gene or any other sequence of 
deoxyribonucleotides of homologous, heterologous or synthetic origin. 
Preferably, the second pre-selected DNA segment is a gene for a structural 
protein, an enzyme, or a regulatory molecule. If the second and third 
recombination sites have the same orientation, activation of the regulatory 

15 nucleotide sequence produces a deletion of the second pre-selected DNA 

segment. If the second and third recombination sites have opposite orientation, 
activation of the regulatory nucleotide sequence produces an inversion of the 
nucleotide sequence of the second pre-selected DNA segment. 

Combinations of wild type and variant recombination sites, and 

20 combinations of different orientations of the recombination sites, in DNA 

introduced into cells can multiply recombination options. For example, if the 
first and second recombination sites are wild type recombination sites and the 
third recombination site is a variant recombination site (all in the same 
orientation) then wild type recombinase can produce a deletion of the first pre- 

25 selected DNA segment (but not the second) and a variant recombinase can 
produce a deletion of the first, second, or both pre-selected DNA segments. 
This arrangement allows sequential deletion of the first and second pre-selected 
DNA segments. 

If the first and second recombination sites are wild type recombination 
30 sites and the third recombination site is a variant recombination site, and the first 
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recombination site has the opposite orientation from the second and third 
recombination sites (which, of course, have the same orientation) then wild type 
recombinase can produce an inversion of the first pre-selected DNA segment 
and a variant recombinase can produce a deletion of the second pre-selected 
5 DNA segment (and/or produce an inversion of the first pre-selected DNA 

segment or the entire section spanning the first, second, and third recombination 
sites), 

If the first and third recombination sites are wild type recombination 
sites and the second recombination site is a variant recombination site, and the 

1 0 second recombination site has the opposite orientation from the first and third 
recombination sites (which, of course, have the same orientation) then wild type 
recombinase can produce a deletion of the entire section spanning the first, 
second, and third recombination sites, and a Cre variant can produce an 
inversion of the first, second, or both pre-selected DNA segments. 

1 5 If the first and third recombination sites are wild type recombination 

sites and the second recombination site is a variant recombination site, and the 
first recombination site has the opposite orientation from the second and third 
recombination sites (which, of course, have the same orientation) then wild type 
recombinase can produce an inversion of the entire section spanning the first, 

20 second, and third recombination sites, and a variant recombinase can produce a 
deletion of the second pre-selected DNA segments and an inversion of the first 
pre-selected DNA segment. 

Many more combinations of wild type and variant recombination sites 
and or recombination site orientations are possible. For example, the variant 

25 recombinase can also be used with a different variant recombinase having a 

different site specificity rather than wild type recombinase. The above examples 
illustrate the general principles involved in designing specific recombinations 
that may be desired. It should be understood that the above combinations of 
recombination sites can be extended to the use of more recombination sites (that 

30 is more than three) and more intervening, pre-selected DNA segments. 
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For some uses of the disclosed recombinases, first, second, and fourth 
DNA sequences comprising a first lox site, a second lox site, and a third lox site, 
respectively, are introduced into cells. As used herein the expression "lox site" 
means a nucleotide sequence at which the gene product of the ere gene, referred 
5 to herein as Cre, and/or the disclosed Cre variants, can catalyze a site-specific 
recombination. LoxP site is a 34 base pair nucleotide sequence (Figure 1) which 
can be directly synthesized or isolated from bacteriophage PI by methods 
known in the art. The Lox P site is an example of a wild type lox site. One 
method for isolating a LoxP site from bacteriophage PI is disclosed by Hoess et 
10 al., Proc. Natl Acad ScL USA, 79:3398-3402 (1982). The LoxP site consists of 
two 13 base pair inverted repeats separated by an 8 base pair spacer region. The 
nucleotide sequences of the insert repeats and the spacer region are as follows. 
ATAACTTCGTATAATGTATGCTATACGAAGTTAT 
Other wild type lox sites include LoxB, LoxL and LoxR sites which are 
15 nucleotide sequences isolated from E. coli. These sequences are disclosed and 
described by Hoess et al., Proc. Natl Acad. Scl USA, 79:3398-3402 (1982). 
Preferred wild type lox sites are LoxP or LoxC2. Lox sites can also be produced 
by a variety of synthetic techniques which are known in the art. For example, 
synthetic techniques for producing lox sites are disclosed by Ito et al., Nuc. Acid 
20 Res., 10:1755 (1982) and Ogilvie et al., Science, 214:270 (1981). 

The gene product of the cre gene is a recombinase herein designated 
"Cre" which effects site-specific recombination of Dl^fA at lox sites. As used 
herein, the expression "cre gene" means a nucleotide sequence which codes for a 
gene product which effects site-specific recombination of DNA in cells at lox 
25 sites. One cre gene (the wild type cre gene) can be isolated from bacteriophage 
PI by methods known in the art. One method for isolating a cre gene from 
bacteriophage PI is disclosed by Abremski et al., Cell, 32; 1301-13 1 1 (1983), the 
disclosure of which is incorporated herein by reference. 

Genes engineered into cells for producing a foreign protein are often 
30 placed under the control of a highly active promoter. The activity of the 
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promoter can result in an overproduction of the protein which interferes with the 
growth of the engineered cell. This overproduction of the protein can make it 
difficult to grow the engineered cell in sufficient quantity to make protein 
production economically feasible. The present invention provides a method 
5 whereby engineered cells can be grown to a desired density prior to expressing 
the engineered gene. The engineered gene is expressed, as desired, by activating 
a regulatory nucleotide sequence responsible for controlling expression of DNA 
encoding a variant recombinase. Methods of controlling the expression of an 
engineered gene include the following: 

10 (1) A DNA segment flanked by recombination sites in the same 

orientation is introduced into DNA in a cell between a promoter and an 
engineered gene to render the promoter incapable of expressing the gene. A 
second DNA sequence comprising a regulatory nucleotide sequence and DNA 
encoding a variant recombinase is also introduced in the DNA. After the 

15 engineered cells are grown to a desired density, the regulatory nucleotide 

sequence is activated thereby effecting expression of the variant recombinase 
and producing a deletion of the DNA segment. The engineered gene would then 
be expressed. 

(2) A gene for a regulatory molecule flanked by recombination sites in 
20 the same orientation is introduced into DNA in a cell The regulatory molecule 

inhibits expression of an engineered gene. A second DNA sequence comprising 
a regulatory nucleotide sequence and DNA encoding a variant recombinase is 
also introduced into the DNA. After the engineered cells are grown to a desired 
density, the regulatory nucleotide sequence is activated thereby effecting 
25 expression of the variant recombinase and producing a deletion of the gene for 
the regulatory molecule. The engineered gene would then be expressed. 

(3) An engineered gene lacking a promoter and flanked by two 
recombination sites in opposite orientations is introduced into DNA in a cell 
such that the 3' end of the gene lies adjacent to the traliscription start site of a 

30 regulatory nucleotide sequence. A second DNA sequence comprising a 
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regulatory nucleotide sequence and DNA encoding a variant recombinase is also 
introduced into the DNA. Since the engineered gene would be transcribed in the 
antisense direction, no engineered protein would be produced. After the 
engineered cell is grown to a desired density, the regulatory nucleotide sequence 
5 is activated thereby effecting expression of the variant recombinase and 

producing an inversion of the desired gene. The engineered gene could then be 
transcribed in the proper direction and expressed. 

Numerous methods and techniques have been developed for the use of 
Cre recombinase and other, similar recombinases such as FLP. The disclosed 

10 variant recombinases can also be used in any of these'methods. Adaptation of 
these methods to the use of the disclosed variant recombinases is 
straightforward. Generally, all that is required is substitution of a variant 
recombinase (or a gene expressing a variant recombinase) for the original 
recombinase (or recombinase gene) and, if appropriate, substitution of variant 

15 (or wild type) recombination sites for the original recombination sites used in 
the method. 

Examples of methods involving wild type recombinases and wild type 
recombination sites that can be adapted for use with the disclosed variant 
recombinases and recombination sites include recombination of DNA in phage 

20 packaging systems, recombination of DNA to form phage display libraries (for 
example, Fisch et aL, Proc Natl Acad Sci USA 93(15):7761-6 (1996), and 
Waterhouse et al. Nucleic Acids Res 21(9):2265-6 (1993), and other uses (for 
example, (Sauer et al, Proc. Natl Acad Sci, USA 84: 9108-91 12 (1987), 
MuUins et al. Nucleic Acids Res 25(12):2539-40 (1997), Aoki et al, Mol Med 

25 5(4):224-31 (1999)). 

Other examples of specific methods in which the disclosed variant 
recombinases can be used or substituted include methods disclosed in U.S. 
Patent No. 5,888,981, U.S. Patent No. 5,888,732, U.S. Patent No. 5,885,836, 
U.S. Patent No. 5,885,793, U.S. Patent No. 5,885,779, U.S. Patent No. 

30 5,885,776, U.S. Patent No, 5,882,893, U.S. Patent No. 5,882,888, U.S. Patent 
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No. 5,877,400, U.S. Patent No. 5,871,907, U.S. Patent No. 5,866,755, U.S. 
Patent No. 5,866,361, U.S. Patent No. 5,859,310, U.S. Patent No. 5,858,657, 
U.S. Patent No. 5,854,067, U.S. Patent No. 5,851,808, U.S. Patent No. 
5,849,995, U.S. Patent No. 5,849,989, U.S. Patent No. 5,849,708, U.S. Patent 
5 No. 5,849,572, U.S. Patent No. 5,849,571, U.S. Patent No. 5,849,553, U.S. 
Patent No. 5,844,079, U.S. Patent No. 5,843,744, U.S. Patent No. 5,843,742, 
U.S. Patent No. 5,843,694, U.S. Patent No. 5,840,540, U.S. Patent No. 
5,837,844, U.S. Patent No. 5,837,242, U.S. Patent No. 5,834,202, U.S. Patent 
No. 5,830,729, U.S. Patent No. 5,830,698, U.S. Patent No. 5,830,461, U.S. 

10 Patent No. 5,817,492, U.S. Patent No. 5,814,618, U.S. Patent No. 5,814,500, 
U.S. Patent No. 5,814,300, U.S. Patent No. 5,807,995, U.S. Patent No. 
5,807,708, U.S. Patent No. 5,801,030, U.S. Patent No. 5,800,998, U.S. Patent 
No. 5,795,734, U.S. Patent No. 5,795,726, U.S. Patent No. 5,792,833, U.S. 
Patent No. 5,792,632, U.S. Patent No. 5,789,156, U.S. Patent No. 5,777,194, 

1 5 U.S. Patent No. 5,776,449, U.S. Patent No. 5,773,697, U.S. Patent No. 

5,770,384, U.S. Patent No. 5,767,376, U.S. Patent No. 5,763,240, U.S. Patent 
No. 5,756,671, U.S. Patent No. 5,744,343, U.S. Patent No. 5,744,336, U.S. 
Patent No. 5,736,377, U.S. Patent No. 5,733,744, U.S. Patent No. 5,733,743, 
U.S. Patent No. 5,733,733, U.S. Patent No. 5,731,182, U.S. Patent No. 

20 5,723,765, U.S. Patent No. 5,723,333, U.S. Patent No. 5,723,287, U.S. Patent 
No. 5,721,367, U.S. Patent No. 5,721,118, U.S. Patent No. 5,700,470, U.S. 
Patent No. 5,686,595, U.S. Patent No. 5,679,523, U.S. Patent No. 5,677,177, 
U.S. Patent No. 5,658,772, U.S. Patent No. 5,656,438, U.S. Patent No. 
5,654,182, U.S. Patent No. 5,654,168, U.S. Patent No. 5,650,491, U.S. Patent 

25 No. 5,650,308, U.S. Patent No. 5,650,298, U.S. Patent No. 5,643,727, U.S. 
Patent No. 5,641,866, U.S. Patent No. 5,641,748, U.S. Patent No. 5,639,726, 
U.S. Patent No. 5,635,381, U.S. Patent No. 5,629,179, U.S. Patent No. 
5,629,159, U.S. Patent No. 5,614,389, U.S. Patent No. 5,612,205, U.S. Patent 
No. 5,596,089, U.S. Patent No. 5,591,609, U.S. Patent No. 5,589,362, U.S. 

30 Patent No. 5,539,094, U.S. Patent No. 5,530,191, U.S. Patent No. 5,527,695, 
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U.S. Patent No, 5,510,099, U.S. Patent No. 5,478,731, U.S. Patent No, 
5,441,884, U.S. Patent No. 5,434,066, U.S. Patent No. 5,378,618, U.S. Patent 
No. 5,354,668, U.S. Patent No. 5,334,515, U.S. Patent No. 5,300,431, and U.S. 
Patent No. 4,959,317. 
5 1. Use of Variants Recombinases in Plants and Plant Cells 

Methods for introducing a DNA sequence into plant cells are known in 
the art. Nucleic acids can generally be introduced into plant protoplasts, with or 
without the aid of electroporation, polyethylene glycol, or other processes 
known to alter membrane permeability. Nucleic acid constructs can also be 

10 introduced into plants using vectors comprising part of the Ti- or Ri-plasmid, a 
plant virus, or an autonomously replicating sequence^ Nucleic acid constructs 
can also be introduced into plants by microinjection or by high- velocity 
microprojectiles, also termed "particle bombardment" or "biolistics" (Sanford, J. 
C, Tibtech 6: 299 (1988)), directly into various plant parts. The preferred 

1 5 means of introducing a nucleic acid fragment into plant cells involves the use of 
A. tumefaciens containing the nucleic acid fragment between T~DNA borders 
either on a disarmed Ti-plasmid (that is, a Ti-plasmid from which the genes for 
tumorigenicity have been deleted) or in a binary vector in trans to a disarmed Ti- 
plasmid. The Agrobacterium can be used to transform plants by inoculation of 

20 tissue explants, such as stems, roots, or leaf discs, by co-cultivation with plant 
protoplasts, or by inoculation of seeds or wounded plant parts. 

Foreign genes can be introduced into a wide range of crop species. 
Thus, the disclosed variant recombinases and method are applicable to a broad 
range of agronomically or horticulturally useful plants. The particular method 

25 which is employed to introduce the DNA sequence into a selected plant cell is 
not critical. In a preferred embodiment, DNA sequences are introduced into 
plant cells by co-cultivation of leaf discs with J. tumefaciens essentially as 
described by Horsch et al., Science, 227: 1229-1231 (1985) omitting the nurse 
cultures. 
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In the present method, the recombination sites are contacted with a 
variant recombinase, thereby producing the site specific recombination. In one 
embodiment, a variant recombinase, or messenger RNA encoding a variant 
recombinase, is introduced into the cells directly by micro-injection, biolistics, 
5 or other protein or RNA introduction procedure. In a^preferred embodiment, 
DNA encoding the variant recombinase is introduced into the plant cell under 
the control of a promoter that is active in plant cells. Suitable regulatory 
nucleotide sequences are known in the art. The promoter which is employed 
with a selected plant cell is not critical to the method of the invention. A partial 

1 0 list of suitable promoters include the 35S promoter of cauliflower mosaic virus 
described by Odell et al, Nature, 313: 810-812 (1985); the promoter from the 
nopaUne synthase gene of J. tumefaciens described by Depicker et al, J. ofMol 
Appl Genet, 1 : 561-573 (1982); the promoter from a Rubisco small subunit 
gene described by Mazur and Chui, Nucleic Acids Research 13: 2373-2386 

1 5 (1 985); the 1' or 2' promoter from the TR-DNA of A. tumefaciens described by 
Velten et al., EMBO J. 12: 2723-2730 (1984); the promoter of a chlorophyll a/b 
binding protein gene described by Dunsmuir et al, J. Mol Appl Genet, 2: 285- 
300 (1983); the promoter of a soybean seed storage pyotein gene described by 
Chen et al, Proc, Natl Acad Scl USA, 83: 8560-8564 (1986); and the promoter 

20 from the wheat EM gene described by Marcotte et al., Nature 335: 454-457 

(1988). Variant recombinases can be expressed throughout the plant generally 
in all cells at all stages of development, or expression of variant recombinases 
can be more specifically controlled through the use of promoters or regulatory 
nucleotide sequences having limited expression characteristics. Variant 

25 recombinases can be expressed in a tissue specific manner, for example only in 
roots, leaves, or certain flower parts. Variant recombinases can be expressed in 
a developmentally specific time period, for example only during seed formation 
or during reproductive cell formation. Expression of variant recombinases can 
also be placed under the control of a promoter that can be regulated by 

30 application of an inducer. In this case expression is off or very low until the 
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external inducer is applied. Promoters active in plant cells have been described 
that are inducible by heat shock (Gurley et al, Mol Cell BioL 6: 559-565 
(1986)), ethylene (Bfoglie et al.. Plant Cell 1 : 599-607 (1989)), auxin (Hagan 
and Guilfoyle, Mol Cell BioL 5: 1197-1203 (1985)), abscisic acid (Marcotte et 
5 al., Nature 335: 454-457 (1988)), salicylic acid (EPO 332104A2 and EPO 
337532A1), and substituted benzenesulfonamide safeners (WO 90/11361). 
Control of expression of variant recombinases by the safener-inducible promoter 
2-2, or its derivatives, allows the expression to be turned on only when the 
inducing chemical is applied and not in response to environmental or 

1 0 phytohormonal stimuli. Thus expression can be initiated at any desired time in 
the plant life cycle. Preferably, the regulatory nucleotide sequence is a 35S 
promoter or a 2-2 promoter. The above techniques and materials can also be 
used to express wild type recombinase in plant cells if required by the particular 
recombination pattern to be accomplished. 

15 One appHcation of the disclosed variant recombinases is in controlling 

male fertility in a method for producing hybrid crops. Hybridization of a crop 
involves the crossing of two different Unes to produce hybrid seed from which 
the crop plants are grown. Hybrid crops are superior in that more of the desired 
traits can be introduced into the production plants. For instance, quality traits 

20 such as oil content, herbicide resistance, disease resistance, adaptability to 

environmental conditions, and the hke, can be hybridized in offspring so that the 
latter are invested with the most desirable traits of its parents. In addition, 
progeny from a hybrid cross may possess new quahties resulting from the 
combination of the two parental types, such as yield enhancement resulting from 

25 the phenomenon known as heterosis. Controlled cross-fertilization to produce 
hybrid seeds has been difficult to achieve commercially due to competing self- 
fertilization, which occurs in most crop plants. 

Hybrid seed production is typically performed by one of the following 
means: (a) mechanically removing or covering the male organs to prevent self- 

30 fertilization followed by exposing the male-disabled plants to plants with male 
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organs that contain the trait(s) desired for crossing; (b) growing genetically 
male-sterile plants in the presence of plants with fertile male organs that contain 
the trait that is desired for crossing; or (c) treating plants with chemical 
hybridizing agents (CHA) that selectively sterilize male organs followed by 
5 exposing the male-disabled plants to plants with fertile male organs that contain 
the trait that is desired for crossing. Some disadvantages to each of these 
methods include: (a) applicability only to a few crops, such as corn, where the 
male and female organs are well separated; and it is labor intensive and costly; 
(b) genetically male sterile lines are cumbersome to maintain, requiring crosses 

10 with restorer lines; (c) all CHAs exhibit some degree of general phytotoxicity 
and female fertility reduction. Also CHAs often show different degrees of 
effectiveness toward different crop species, or even toward different varieties 
within the same species. 

A molecular genetic approach to hybrid crop production applicable to a 

15 wide range of crops and involves genetic male sterility is described in EPA 89- 
344029. This system involves the introduction of a cell disruption gene that is 
expressed only in the tapetal tissue of anthers thereby destroying the developing 
pollen. The resulting genetically male sterile plants serve as the female parents 
in the cross to produce hybrid seed. This system could be highly effective and 

20 desirable. However one disadvantage is that since the male sterile parent is 

heterozygous for the sterility gene which acts as a dominant trait, only 50% of 
the plants grown from the hybrid seed are fertile, the rest retain the sterility 
gene. This situation will result in reduced pollen shed in the production field 
which may lead to reduced seed set and yield. Addition of recombinase 

25 technology to this hybrid scheme allows restoration of fertility to a much higher 
percentage of plants in the production field, as well as elimination of the cell 
disruption gene. Placing the male sterility gene between recombination sites 
allows it to be deleted following introduction of a variant recombinase into the 
hybrid from the male parent. 
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Another application of the disclosed variant recombinases is in making 
seedless produce. Seedlessness is desirable in consumed produce for 
convenience and taste. Currently "seedless" watermelon is sold that actually 
contains some developed seed and a large number of immature seed that varies 
5 in size up to that of fully mature seed. To produce these watermelon first a 
hybrid cross is made between a tetraploid maternal parent and a diploid 
pollinator. The resuhing triploid seed produces self-infertile plants that are 
crossed with a diploid pollinator to produce seedless fruit (Kihara, Proc, Soc. 
Hon Sci, 58: 217-230, (1951)). This production scheme suffers the following 

10 problems: (i) Creating a tetraploid plant, which is accomplished by a 

chromosome duplication method, is difficuh. Also the number of seeds per fruit 
on this tetraploid plant must be low since this has a positive correlation with 
seed number in the final product (Andrus, Production of Seedless Watermelons, 
USDA Tech. Bull. No. 1425 (1971)); (ii) good combining ability of the diploid 

15 poUinator and the tetraploid plant is difficult to achieve (Henderson, 1 Amer, 

Soc. Hon Sci., 102: 293-297 (1977)); (iii) the triploid seeds are much inferior to 
regular diploid seeds in vigor and germinability (Maynard, Hon Sci,, 24: 603- 
604 (1989)). These problems, together with incomplete seedlessness in the final 
product, make the development of seedless watermelon slow and difficult. This 

20 ploidy-based approach to seedlessness is possible only in those few species 
where unusual euploidy plants (tetraploid and triploid for watermelon, for 
example) are viable. 

A molecular genetic approach to seedlessness involving the disclosed 
variant recombinases is much more efficient, resulting in a more reliably 

25 seedless product and does not involve changes in ploidy. Thus it is more 

generally applicable to a wider range of species. A recombination site/poly A- 
inactivated cell disruption gene regulated by a seed-specific promoter is 
introduced into a plant. When this plant is crossed to a plant expressing a 
variant recombinase, the disruption gene is activated and expressed in the seed, 

30 thereby disrupting seed development. The certainty of endosperm failure 
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(caused by the cell disruption gene product) leading to the abortion of the whole 
seed is very high. In most dicots, the endosperm supplies the nutrients needed 
for early embryo development. Endosperm abortion invariably leads to seed 
abortion (Brink and Cooper, Bot. Rev, 8: 423-541 (1947)). 
5 The seed-specific promoter used can be selected from the group of 

promoters known to direct expression in the embryo and/or the endosperm of the 
developing seed, most desirably in the endosperm. Examples of seed-specific 
promoters include but are not limited to the promoters of seed storage proteins. 
The seed storage proteins are strictly regulated, being expressed almost 
10 exclusively in seeds in a highly tissue-specific and stage-specific manner 

(Higgins et a\.,Ann. Rev. Plant Physiol. 35: 191-221 (1984); Goldberg et al., 
Cell 56: 149-160 (1989)). Also, different seed storage proteins may be 
expressed at different stages of seed development and in different parts of the 
seed. 

1 5 There are numerous examples of seed-specific expression of seed storage 

protein genes in transgenic dicotyledonous plants. These include genes from 
dicotyledonous plants for bean p-phaseolin (Sengupta-Goplalan et al., Proc. 
Natl. Acad. Sci. USA 82: 3320-3324 (1985) and Hoffman et al. Plant Mol. Biol. 
11: 717-729 (1988)), bean lectin (Voelker et al., EMB0J6: 3571-3577 (1987), 

20 soybean lectin (Ocamuro et al., Proc. Natl. Acad. Sci. USA 83: 8240-8344 

(1986)). soybean kunltz trypsin inhibitor (Perez-Grau and Goldberg Plant Cell 1 : 
1095-1 109 (1989)), soybean p-conglycinin (Beachy et al., EMB0J4: 3047- 
3053 (1985), Barker et al., Proc. Natl Acad Sci. 85: 458-462 (1988), Chen et 
al, EMBOJl: 297-302 (1988), Chen et al.. Dev. Genet, 10: 112-122 (1989), 

25 Naito et al. Plant Mol. Biol. 1 1 : 683-695 (1988)), pea vicillin (Higgins et al.. 
Plant Mol. Biol. 11: 109-123 (1988)), pea convicilUin (Newbigin et al., Planta 
1 80: 461 (1990)), pea legumin (Shirsat et al., Mol Gen. Genetics 215: 326 
(1989)), rapeseed napin (Radke et al., Theor. Appl. Genet. 75: 685-694 (1988)), 
as well as genes from monocotyledonous plants such as for maize 15-kd zein 

30 (Hoffinan et al., EMB0J6: 3213-3221 (1987)), barley P-hordein (Marris et al., 
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Plant Mol Biol 10: 359-366 (1988)), and wheat glutenin (Colot et al, EMBOJ 
6: 3559-3564 (1987)). Moreover, promoters of seed-specific genes 
operably linked to heterologous coding regions in chimeric gene constructions 
also maintain their temporal and spatial expression pattern in transgenic plants. 

5 Such examples include Arabidopsis thaliana 2S seed storage protein gene 
promoter to express enkephalin peptides in Arabidopsis and Brassica napus 
seeds (Vandekerckhove et al., Bio/Technology 7: 929-932 (1989)), bean lectin 
and bean p-phaseolin promoters to express luciferase (Riggs et al., Plant Sci. 63 : 
47-57 (1989)), and wheat glutenin promoters to express chloramphenicol acetyl 

10 transferase (Colot et al, EMBO J. 6: 3559-3564 (1987)). Promoters highly 

expressed early in endosperm development are most effective in this application. 
Of particular interest is the promoter from the a subunit of the soybean p- 
conglycinin gene (Walling et al, Proc. Natl Acad, Sci USA 83: 2123-2127 
(1986)) which is expressed early in seed development in the endosperm and the 

15 embryo. 

The cell disruption gene used can be selected from a group of genes 
encoding products that disrupt normal functioning of cells. There are many 
proteins that are toxic to cells when expressed in an unnatural situation. 
Examples include the genes for the restriction enzyme EcoRI (Barnes and Rine, 

20 Proc, Natl Acad, Set USA 82: 1354-1358 (1985)), diphtheria toxin A 

(Yamaizumi et al., Cell 15: 245-250 (1987)), streptavidin (Sano and Cantor, 
Proc, Natl Acad Sci. USA 87: 142-146 (1990)), andbamase (Paddon and 
Hartley, Gene 53: 11-19 (1987)). Most preferred for this system is the coding 
region of barnase which has been shown to be highly effective in disrupting the 

25 function of plant cells (EPA 89-344029). 

A highly desirable seedless system is one in which fully fertile Fl seed 
develops, that can then be grown into plants that produce only seedless fruit. 
This system is economically favorable in that for eacfi cross pollination, a large 
number of seedless fruits result: the number of Fl seed from one cross X the 

30 number of fruits produced on an Fl plant. Also incorporated in this scheme are 
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the advantages of growing a hybrid crop, including the combining of more 
valuable traits and hybrid vigor. This is accomplished in the same manner as 
described above except that the recombination site/poly A-inactivated disruption 
gene is expressed from a seed maternal tissue (seed coat or nucellus)-specific 
5 promoter. For example, the seed coat is the outgrowth of the integuments, a 

strictly maternal tissue. Therefore the hybrid cross that brings the recombination 
site/polyA-inactivated disruption gene together with the recombinase gene does 
not involve this seed coat tissue. The seed coat of the Fl seed has either 
recombination sites or recombinase, depending on which is used as the female 

10 parent, and thus Fl seed develop normally. After the Fl seed gives rise to a 

fruit-bearing Fl plant, all vegetative cells (including seed coat cells) inherit both 
recombination sites and recombinase from the embryo. Thus the seed coat of 
the Fl plant has an activated cell disruption gene. 

The seed coat is an essential tissue for seed development and viability. 

1 5 When the seed is fully matured, the seed coat serves as a protective layer to 
inner parts of the seed. During seed development, the seed coat is a vital 
nutrient-importing tissue for the developing embryo. The seed is nutritionally 
"parasitic" to the mother plant. All raw materials necessary for seed growth 
must be imported. In seeds of dicotyledonous plants, the vascular tissue enters 

20 the seed through the funiculus and then anastamoses in the seed coat tissue. 
There is no vascular tissue connection or plasmodesmata linkage between the 
seed coat and the embryo. Therefore, all nutrient solutes delivered into the 
developing seed must be unloaded inside the seed coat and then move by 
diffusion to the embryo. Techniques have been developed to study the nutrient 

25 composition in the seed coat (Hsu et al., Plant Physiol 75: 181 (1984); Thome 
& Rainbird, Plant Physiol 72: 268 (1983); Patrick, 1 Plant Physiol 115: 297 
(1984); Wolswinlcel & Ammerlaan, 1 Exp. Bot, 36: 359 (1985)), and also the 
detailed cellular mechanisms of solute unloading (Offler & Patrick, Aust. J, 
Plant Physiol 1 1 : 79 (1984); Patrick, Physiol Plant 78: 298 (1990)). It is 
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obvious that the destruction of this vital nutrient-flmneUing tissue causes seed 
abortion. 

The disclosed tissue-specific and site-directeADNA recombination can 
be used to obtain seedless fruit production. This method is useful for the 

5 production of seedless watermelon, for example. A combination of gene 
expression specific for maternally inherited seed tissue and the disclosed 
recombinase system can be used for the production of seedless watermelon. The 
system can be universally applied to any horticultural crop in which the presence 
of seeds is undesirable and difficult to be eliminated through conventional 

10 breeding methods. The system also allows the normal production of Fl seeds. 
The ability to maintain heterosis is an advantage of producing F2 seedless fruits. 

The existing production of seedless watermelon indicates that seed 
development is not essential for the watermelon fruit development. However, 
conventional production of seedless watermelon using the ploidy imbalance 

1 5 trick has never been very popular due to the difficulty of overcoming the yield 
and production problems. Creating and maintaining the tetraploid (4n) female 
germline, and producing the triploid (3n) seeds have made the seed cost high. 
Cross-pollination is needed for the production of triploid seeds (4n X 2n) and 
seedless fruits (3n X 2n). Also triploid seed germination is usually poor due to 

20 ploidy imbalance. 

The present approach eliminates the dependence on polyploid germlines 
and provides an efficient system for producing seedless fruit. The products of 
double fertilization of higher plants are the embryo and endosperm. The seed 
coat (including the integumentary tapetum) and nucellus (the tissue 

25 encompassing the embryo sac) are the remaining seed tissues that are maternally 
inherited. In addition to general protection, the seed coat and nucellus also play 
an important role in importing nutrients into the developing embryo and 
endosperm. Seed development will be aborted if this vital nutrient-importing 
mechanism of the seed coat/nucellus is debilitated. Tjiis will be accomplished 

30 by using the recombinase system to activate a cell-damaging gene only in these 
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tissues. Controlling the gene activation in a maternal tissue-specific manner 
allows production of normal Fl seed, but abortion of F2 seed, A seed coat or 
nucellus promoter is coupled to a tissue-destructive (lethal) gene in order to 
prevent seeds from forming. The destructive gene is inactive in the seed parent 
5 due to the presence of a blocking transcription terminator. The terminator is 
flanked by recombination sites for subsequent excision by a recombinase- 
mediated recombination event. Expression of the recombinase is also controlled 
by the seed coat/nucellus-specific promoter. When plants carrying the separate 
recombinase and recombination site constructs are crossed, the Fl seed will be 

10 viable because seed coat/nucellus is maternal tissue, and in that tissue 

recombinase and recombination sites are not combined. When the Fl seed is 
used as planting seed, the self-pollinated or out-crossed plants will produce 
seedless fruits or vegetables, since in seed coat/nucellus tissues recombinase and 
recombination sites are combined, and the lethal gene is activated, 

1 5 2. Use of Variant Recombinases for Phage Packaging 

The disclosed variant recombinases can also be used to aid in phage 
packaging. The cloning system described herein utilizes a headful in vitro 
packaging system to clone foreign DNA fragments as large as 95 kb which 
permits the isolation of DNA fragments that are at least twice the size of those 

20 that can be obtained by lambda cosmid cloning. This increased cloning capacity 
has the following utility: 

(1) Genes in the 45-95 kb size range and, more particularly, in the 70-95 
kb size range can now be directly cloned and genes in the 25-45 kb size range 
can be cloned more easily. 

25 (2) Chromosomal "walking" and "jumping" techniques can be speeded 

up by a factor of at least two and should be more accurate because of the 
reduced number of contiguous segments that need to be linlced together. 

(3) The cloning system of the invention is useful as a means for the 
delivery of DNA efficiently to bacteria which otherwise do not take up DNA 

3 0 from solution well . 
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Specifically, the headful packaging system of this invention for cloning 
foreign DNA fragments as large as 95 kb comprises: 

(a) modifying vector DNA by inserting a stuffer fragment into a blunt 
end producing site which is proximal to a pac site; 
5 (b) digesting the product of step (a) to produce two vector arms each of 

which contains (i) a blunt end, (ii) another end which is compatible with the 
foreign DNA fragment which is to be cloned, and (iii) a recombination site; 

(c) ligating the foreign DNA to the product of step (b) without generating 
concatemers; 

1 0 (d) reacting the product of step (c) with pac cleavage proficient extract 

and head-tail proficient extract wherein the ratio of large heads to small heads in 
the head-tail extract is at least 5:1; 

(e) infecting a bacterial strain expressing a variant recombinase with the 
product of step (d); and 

1 5 (f) recovering the cloned DNA. 

The term pac is a generic name which refers to the site needed to initiate 
packaging of DNA. The pac cleavage proficient extract contains the recognition 
proteins necessary to cleave the pac site and, thus, initiate packaging. The head- 
tail proficient extract contains the heads and tails needed to package the cloned 

20 DNA into a virus particle. The term concatemer means a DNA molecule 

consisting of repeating units arranged in a head-to-tail configuration. The term 
stuffer fragment refers to a DNA fragment which is inserted into the vector 
DNA at a unique site, and within which headful packaging is terminated. The 
terms bacteriophage and phage are used interchangeably herein. 

25 Although many of the elements described herein pertain to the PI 

bacteriophage cloning system, those skilled in the art will appreciate that, with 
the exception of the components needed to package DNA (pac and packaging 
extracts), many of the elements discussed below, such as plasmid replicon and a 
multicopy or lytic replicon, pertain to the recovery of packaged DNA and can be 
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used to recover DNA in bacteria, such as E. colU with other cloning systems, for 
example, bacteriophage, yeast, etc. 

Bacteriophages which are suitable to practice the invention must have a 
large head capacity and the elements necessary for packaging DNA must be 

5 defined. For example, for phages P22 and Tl, which utilize headful packaging, 
the necessary packaging elements are defined. However, P22 and Tl do not 
have a very large head capacity. On the other hand, for phage T4, which has a 
large head capacity, the necessary packaging elements have not been defined. 
The elements necessary for packaging DNA (i.e., an in vitro headful 

10 packaging system) are the following: 

(1) a unique site, pac, which is cleaved by recognition proteins; it is the 
pac cleavage proficient extract which contains the recognition proteins 
necessary to cleave the pac site; and 

(2) empty phage heads into which the DNA is packaged until the head 

1 5 has been completely filled, then a cleavage event is triggered (the "headful" cut) 
which separates the packaged DNA away fi-om the remaining components; it is 
the head-tail proficient extract which contains the heads and tails needed to 
package the cloned DNA into virus particle. 

Although initiation of packaging is site-specific (cleavage of pac site 

20 initiates packaging), termination of packaging is not site-specific. In other 
words, no unique site is recognized, as packaging will terminate at whatever 
point the head has been filled. 

In the case of PI, the DNA substrate used in the packaging reaction 
during the viral life cycle is a concatemer consisting of individual units of the PI 

25 chromosome arranged in a head-to-tail manner. Headful packaging, using either 
PI phage or any other phage, is a four step process: (1) In the first step a unique 
site, pac, is recognized and cleaved by the pac recognition proteins (PRPs); (2) 
DNA on one side of the cleavage is packaged into an empty phage head until the 
head has been completely filled; (3) a second cleavage event is then triggered 

30 (the "headful" cut) that separates the packaged DNA away from the rest of the 
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concatemer; and (4) initiation of a second round of DNA packaging from the 
free end generated by the previous ''headfal" cut-hence the term processive 
headful cutting. However, if a concatemer is not generated then processive 
headful packaging does not occur. 
5 The ends of the packaged PI DNA do not contain complementary single- 

stranded sequences, as do the ends of packaged bacteriophage lambda DNA, and 
consequently after PI DNA is injected into a bacterium its cyclization does not 
occur by strand annealing but rather by recombination between homologous 
sequences present at the ends of the molecule. Because of this circumstance, 

10 any vector that uses PI packaging, or for that matter any headful packaging 
mechanism, must devise a means of cyclizing the linear packaged DNA by 
recombination. Cyclizing is accomplished by incorporating recombination sites 
into the vector and using a disclosed variant recombinase to cyclize the DNA 
after injection into gram-negative bacterial strains expressing the variant 

1 5 recombinase. 

PI produces two head sizes, a big head that can accommodate 105-1 10 
kb of DNA, and a small head that can accommodate no more than 45 kb of 
DNA. Normally the ratio of big to small heads in a PI wild-type infection is 
10:1, however, in the cm-2 mutant of PI used to prepare some of the packaging 

20 lysates described herein, the radio of head sizes is 1 :1 . The head-tail packaging 
lysate prepared from the cm mutant of PI contained the usual ratio of big to 
small heads which is about 10:1. This is the preferred lysate for preparing head- 
tail packaging extract. To ensure packaging of DNA exclusively into the big 
phage heads, the DNA must be bigger than that which can be accommodated by 

25 the small heads. It is generally desired that there be a large excess of big heads. 
However, the ratio of large heads to small heads should not fall below a ratio of 
about 5:1. 

Illustration 

The following illustration describes an example of how the disclosed 
30 method can be used to generate variant FLP recombinases with altered site 
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specificity. As with other recombinase, the method preferably uses the 
following components: 

1 . An in vitro mutagenesis system; 

2. A recombinase expression plasmid that allows varying levels of 

5 expression by a simple environmental control (for example, by the presence of 
varying amounts of an inducer substance in the growth media, by temperature, 
or by osmolarity); 

3. An indicator/selector bacterial strain. The strain carries both an 
indicator recombination substrate for detection of recombination at the wild type 

10 recombination site and a second recombination substrate that allows selection 
for recombinase mutants that have gained the ability to recognize and perform 
recombination at a target mutant recombination site (that is, a variant 
recombination site). Importantly, the wild type and target mutant sites are 
designed so that recombination between the mutant and wild type sites is 

1 5 blocked even with a mutant recombinase that can recognize both the wildtype 
site and also the target mutant site. This design prevents unwanted 
recombination between the wildtype and target mutant recombination sites that 
could interfere with either selection or detection of desired recombinational 
outcomes. The block is imposed by designing the wildtype and mutant sites to 

20 have different spacer regions (that is, different compatibility sequences), for 
example, the normal "wt" spacer for the wildtype recombination site, and an 
alternative spacer "Al" for the other recombination substrate. In an otherwise 
nonmutant recombination site DNA recombination proceeds efficiently both for 
recombination sites having the wt spacer (that is, a recombination between two 

25 wt sites) and also for sites having the Al spacer (that is, a recombination 

between two Al sites). Yet, recombination between the Al site and the wt site 
is blocked (that is, recombination between a wt site and an Al does not occur). 
This strategy is applicable to all recombinases that have a recombination target 
site displaying one or more recombinase binding sites (repeat elements) on each 

30 side of a spacer region in which recombination occurs (Nxmes-Doby et al., Nucl 
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Acids Res., 26:391-406 (1998)). Such sites display a requirement for homology 
in the spacer elements for optimal recombination activity and has been shown to 
be the case for members of the Int family of recombinases, including Cre, 
lambda Int, and FLP (Craig, Ann. Rev. Genet. 22:77-105 (1988)). 

5 The preferred in vitro mutagenesis system is that of Stemmer (Stemmer, 

Nature 370:389-391 (1994)), or a variant of that strategy. After mutagenesis 
and assembly of fragments into a full-length FLP gene, it is cloned into the 
expression vector. 

The expression plasmid to be used can be any of the 'Inducible" 

10 expression plasmids available in bacteria. For this illustration one of the pBAD 
plasmids for E. coli was chosen that allows expression of recombinase by 
growth on arabinose (Guzman et al, X BacterioL 177:4121-4130 (1995)), and 
which can be turned off by growth on glucose (and no arabinose). For this 
illustration the expression plasmid carries the replication origin of pACYC , and 

1 5 the FLP recombinase gene is under the control of the E, coli ara promoter 

region. In addition, the plasmid carries the selectable marker Cm' which confers 
resistance to the antibiotic chloramphenicol. Because the pACYC replicon is 
low copy, its use may be advantageous in preventing excessive expression of 
FLP. Alternatively, a higher copy replicon could be used, such as that of ColEl . 

20 In that case the expression level of FLP must also be carefully controlled using 
the inducer substance arabinose. 

The indicator/selector bacteria carries two different reporter constructs 
for FLP-mediated recombination. The first reporter construct consists of two 
FRT sites (FLP recombination target; that is, the recombination site recognized 

25 by FLP recombinase) in direct orientation (an excision substrate) and resides on 
a low copy replicon that is compatible with the FLP expression construct. In 
this example the first substrate is integrated into the E, coli genome. This can be 
done by incorporating the FRT substrate onto phage lambda and then 
constructing a lambda lysogen. Alternatively the FRT substrate could reside on 

30 a low-copy replicon that is compatible with that of the FLP expression vector 
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and which has an additional selectable marker, for example resistance to 
bleomycin. This FRT substrate carries two FRT sites in direct orientation 
flanking a gene whose presence can be easily monitored. In this illustration a 
constitutively expressing lacZ gene was used whose presence can be determined 

5 simply by growing colonies plates containing X-gal, upon which they will 
become blue in color. Loss of the lacZ gene by FLP-mediated recombination 
results in white colony formation on X-gal plates. 

The indicator/selector bacterial strain also carries a second FRT-like 
substrate. This is a plasmid element having two FRT-M sites in direct 

1 0 orientation flanking DNA sequences (STOP) that disallow expression of a 

downstream selectable marker. In this example, nonexpression of the selectable 
marker is achieved by placing genetic elements in the following order: 
constitutive promoter - FRT-M - STOP - FRT-M - 'neo, where 'neo indicates 
the promoterless neo gene of Tn5. Hence this cassette cannot express neo and 

1 5 cells are sensitive to the antibiotic kanamycin. Excision of STOP by 

recombination at FRT-M is designed to permit expression of neo so that cells 
now become resistant to kanamycin. The plasmid carries an additional 
selectable marker, Ap' conferring resistance to ampicillin, to maintain presence 
of the plasmid in E, colL The STOP sequence here is the strong transcriptional 

20 terminator rmBTlT2 (Liebke et al., Nucleic Acids Res, 13:5515-5525 (1985)). 
Figure 1 7 shows wt FRT, FRT-Al , and FRT-M sites used in this 
illustration. Although the wildtype FRT site displays three inverted repeat 
elements, recombination proceeds efficiently with sites carrying two of these 
repeats in the inverted configuration shown (Jayaram, Proc, Natl Acad Sci. 

25 USA 82:5875-5879 (1985)). Either the full or minimal site can be used since 

both are recombinationally functional. The FRT-Al site is designed to have an 
altered spacer but which is still functional for self X Sfelf recombination 
(Senecoff et al., I Biol Chem, 261 :7380-7386 (1986)). The target FRT-M site 
is designed to carry symmetrical mutations in the repeat elements that disallow 
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efficient FLP-mediated recombination (Senecoff et al, 1 Mol Biol 201:405- 
421 (1988)), and also the spacer mutation of FRT-AL 

Importantly, the FRT-M site differs from the FRT site in two ways. 
First, both of the 13 bp inverted repeat elements (that is the recognition 

5 sequences) are mutated in a symmetrical manner such that the wt FLP enzyme 
does not catalyze recombination between two FRT-M sites, or does so only 
extremely poorly (< 0.1%). Second, the spacer region (that is, the compatibility 
sequence; the 8 bp region between the 13 bp inverted^repeats) is replaced with 
an ahernate spacer AL The alternate spacer when present in an otherwise wt 

10 FRT site, which we will call FRT-Al, is permissive for FLP-mediated 

recombination between two FRT-Al sites, but does not permit recombination 
between FRT-Al and FRT. Use of the FRT-M site which contains the 
heterologous spacer prevents FLP-mediated recombination between FRT and 
FRT-M by FLP or a mutant FLP protein that might otherwise catalyze 

1 5 recombination between the wt FRT substrate and the mutant target FRT-M 
substrate. Unwanted recombination between the wildtype and target mutant 
recombination sites would decrease efficiency of the selection procedure by (a) 
not limiting recombination at the target mutant site specifically to these sites and 
thus compromising the selection (at FRT-M sites) for mutant FLP recombinases, 

20 (b) affecting the accuracy of the specific indication of activity at the wt FRT 
sites, and (c) decreasing either the plasmid stability of the FRT-M selector 
substrate or the integrity of bacterial chromosome (or compatible plasmid) 
carrying the wt FRT sites. 

Procedure (Figure 18): the FLP gene is mutagenized in vitro and then 

25 cloned into the inducible expression vector, in this case a pBAD derivative that 
places FLP under the control of the arabinose-inducible pB AD promoter. The 
pool of mutagenized FLP genes is transformed into the FRT indicator/selector 
strain which is pre-induced with arabinose and/or induced with arabinose during 
DNA transformation. Bacterial colonies are then selected to be simultaneously 

30 resistant to chloramphenicol (to retain the FLP expression plasmid), ampicillin 
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or carbenicillin (to retain the selector plasmid) and kanamycin (to select for cells 
in which FRT-M X FRT-M recombination has occurred) on agar plates 
containing either arabinose (for continued FLP expression) or glucose (to 
prevent prolonged FLP expression). In some instances it may be advantageous 

5 to limit FLP expression to better enrich for those FLP mutants that either have 
more avidity for recombination at the FRT-M sites or to better exclude those 
FLP mutants that retain activity at the wildtype FRT sites. This is because 
prolonged or high-level FRT expression can lead to inefficient but detectable 
recombination at mutant sites, 

1 0 Either all Kan' colonies or only those that are blue on X-gal plates are 

then pooled and harvested for DNA preparation. A second round of FRT gene 
mutagenesis and selection is then initiated by PGR amplification. Multiple 
rounds of mutagenesis and selection are used to obtain FRT mutants with altered 
site-specificity. Comparison of various individual isolates allows determination 

1 5 of critical amino acid residues that contribute to the d€sired mutant phenotype. 

The same rationale and procedure can be used to generate a second class 
of altered FLP recombinases. The target mutant FRT site used is, however, 
different. In this case the target is the FRT-M2 site (Figure 19) which carries a 
different binding site mutation(s) than does the FRT-M site as described above. 

20 Examples 

Example 1: Selection of Variant Cre Recombinases 

The following example describes the production and analysis of some 
examples of the disclosed variant recombinases. Cre mutants characterized by a 
wider substrate recognition were created, applying a technique called directed 

25 molecular evolution: Multiple rounds of a random mutagenesis procedure 
(DNA shuffling; Stemmer, W, P. Proc, Natl Acad, Set USA. 91:10747- 
10751 (1994)) and a sensitive selection for the desired phenotypes allow to 
accumulate candidate mutants within the generated pools of mutated sequences. 
The Cre mutants created in this example showed wt-like activity on loxV sites. 

30 In addition, they performed on an altered substrate, called /oxK2, that is no 
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recognized by the wt enzyme. Two transversions from adenine (/oxP) to 
thymine (/oxK2) at positions 1 1' and 12' of the lox sequence are the barriers that 
inhibit wt Cre from recognizing /oxK2: The two thymines are beUeved to cause 
repulsive forces with the acidic side chain of a glutamate residue in the J helix of 

5 wt Cre (position 262). This glumtamate was found to be replaced by a glycine 
in all mutants with remarkably increased activity on loxK2. Additional site- 
directed mutagenesis experiments, confined to the glutamate at position 262 of 
Cre, could confirm that E262G but also E262W mutations alone are sufficient to 
increase /oxK2 activity by a factor of 10^ without affecting loxV recognition. 

1 0 Other point mutations identified in the analyzed mutants may however be 

responsible for increasing the newly obtained specificity even further (10 fold 
compared to E262G alone). 

MATERIALS AND METHODS 
General Procedures 

1 5 Standard Reagents 

The following reagents were used in all experiments: lOx' TBE (Tris- 
Borate-EDTA, pH 8.3) was purchased from Biofluids, Inc. (Rockville, MD) and 
diluted to Ix with deionized water prior to use. TE (Tris-HCl 10 mM, EDTA 1 
mM, pH 8.0 and pH 7.5) and Tris-HCI (1 M, pH 7.5 and 8.0) came from Quality 

20 Biological, Inc. (Gaithersburg, MD), as well as autoclaved LB (Luria-Bertani) 
and SOC broth. L-(+)-arabinose (>99%) was ordered from Sigma- Aldrich Fine 
Chemicals (St. Louis, MO) and anyhdrous D-glucoseJrom Mallinckrodt 
Laboratory Chemicals (Phillipsburg, NJ). 
Gel Electrophoresis 

25 For DNA electrophoresis, 0.8% agarose TBE gels were used (GTG Sea 

Kem Agarose (FMC, Rockland, ME)). Gels were prestained with 0.25 |ig/ml 
EtBr (Ethidium Bromide, 10 mg/ml (Life Technologies, Inc., Grand Island, 
NY)). The used electrophoresis apparatus was a DNA SUB CELL™ (BioRad, 
Hercules, CA) with an OSP 105 (OWL, Wobum, MA) powersupply. Gels were 

30 run at 60 V (5 V/cm) as recommended by Sambrook et al.. Cold Spring Harbor, 
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New York: Cold Spring Harbor Laboratory Press (Second Edition) (1989). 
Occasionally, for small amounts of samples, 50 ml minigels were used under 
similar conditions (Hoefer HE33, Hoefer Scientific Instruments, San Francisco, 
CA). Molecular weight standards were X / Hind III digest (Research Genetics, 
5 Huntsville, AL) and Ready-Load™ 1 00 bp DNA ladder (Life Technologies), 
providing a standard size range from 100 bp to 23 130 bp (Figure 4). For 
standard fragment purification from gel, the Geneclean II® Kit (BIO 1 0 1 , Inc . , 
La JoUa, CA (1 1/98) was used, following the manufacturer's instructions. 

Minipreps and Plasmids 
10 Plasmids for diagnostics, cloning, and sequence analysis were prepared 

using the Wizard™ Minipreps Plus Kit (Promega, Wizard™ Minipres Plus 
DNA Purification System. Instruction Manual (Madison, WI) (1/96)). Useful 
ones were assigned a pBS number and stored in TE pH 8.0 at +4°C. 

Oligonucleotides 

1 5 All oligonucleotides used as PGR primers, for plasmid construction, or in 

the mutagenesis procedure, were ordered from Midland, Inc. (Midland, TX) in 
gel filtration (GF) quality. The lyophilized oligonucleotides were assigned a 
BSD number, suspended in HPLC grad water (Sigma- Aldrich) at a final 
concentration of 300 ^iM, and stored at -20°C. 

20 DNA Digests and Ligations 

All enzymes used for DNA manipulations (restriction enzymes, T4 
ligase, etc.) were purchased from New England Biolabs, Inc., Catalog. 
(Beverly, MA) (1998/99) and used as recommended in the manufacturer's 
catalog (1998/99). Briefly, for restriction enzyme digests the total reaction 

25 volume was 20 p,l with approximately 10 units (U) of enzyme. For DNA 
fragment ligations, 10 |al with 200 U of T4 DNA Ugase were used. 
E. coli Strains 

All E. coli strains, except otherwise mentioned, were derived originally 
from DH5a: endh.1 hsdRll (rKniK^) supE44 thi-\ recAl gyrA (Nal^) re/Al 
30 A(/acIZYA-iirgF) Ul 69 deoK (M80 dlac A(/acZ)Ml 5) (Woodcock et al, Nud. 
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Acids Res,, 17:3469-3478 (1989); Raleigh et al, In Current Protocols in 
Molecular Biology, eds. Ausubel, F.M. et. al (New York: Publishing Associates 
and Wiley Interscience), Unit L4 (1989), After modification with X prophages 
or plasmids, strains were catalogued by assigning them a BS number and stored 
5 at -80°C with 10% DMSO (Dimethylsulfoxide) after overnight culture in 
appropriate selection medium. 

Transformation of E. coli 

For all plasmid transft)rmations ofE. coli strains, electroporation was 
preferred over chemical protocols. Electorcompetent cells were made and used 

10 for electroporation as described by Smith et al, Focu§, 12:38-40 (1990). The 
appropriate cell porator and cuvettes were from Life Technologies. Depending 
on the selection procedure after electroporation, the time in SOC medium (Smith 
et al, Focus, 12:38-40 (1990) at 37^C under agitation (Lab-Line® Orbit 
Environ-Shaker, Lab Line Instruments, Inc., Melrose Park, IL) prior to plating 

1 5 on selection medium was 1 h for ampicillin (Ap) and 2 h or more h for 

kanamycin (Kan) selection. For induction of ere expression (as described 
below), the transformants were cultivated in SOB (Smith et al, Focus, 12:38-40 
(1990)) supplemented with 0.2% of L-(+)-arabinose (Sigma-Aldrich) plus 
20mM of MgCli (referred to as induction medium) for 2.5 h and 4 h before 

20 plating on the appropriate selection media (see below). Resulting colony 
numbers were counted after overnight incubation at 37''C in a gravity 
convection incubator (Precision Scientific, Chicago, IL). 
E, coli Cultures 

LB (as mentioned above) was used as the standard medixim for all E. coli 
25 cultures (liquid or solid). For selection and screening, the appropriate reagents 
at the following concentrations were added: 
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Table 1: List of reagents used for selection and screening ofE. coli 



cultures. 
Reagent 



Concentration 



Stock Solution 



Ampicillin (Ap) 1 00 |ag/ml 

Chloramphenicol (Cm) 27 \i%lm\ 

Kanamycin (Kan) 16 |ag/ml 

X-gal 0.003% 

Z-ara 0.006% 



50 mg/ml in H2O 
34 mg/ml in EtOH 
10 mg/ml in H2O 
2% in DMF (w/v) 
2% in DMF (w/v) 



The concentration of stock solutions, stored at -20°C and their dilutions 

10 in liquid LB medium or LB-agar plates is given. X-gal stands for 5-bromo-4- 
chloro-3-indolyl-[3-D-galactopyranoside, and Z-ara for 5-bromo-3-indolyl-a-L- 
arabinofuranoside. All reagents, except Z-ara, were purchased from Life 
Technologies and aliquoted in the desired stock concentration for storage, Z-ara 
(Berlin and Sauer, Anal Biochem,, 243:171-175 (1996)) was a generous gift 

15 from W. Berlin. 

Ready-to-use soUd LB-agar plates (2%) plain, or supplemented with Ap 
(100 |ig/ml) were purchased from Digene, Inc. (Beltsville, MD). For all other 
reagent combinations in solid medium, plates were poured according to the 
needs using autoclaved 2% LB agar purchased from Biofluids. 

20 Polymerase Chain Reaction (PCR) 

Standard PCR reactions were carried out in 50 |Lil total volume with the 
following reagents (all, except noted, from Perkin Elmer, Foster City, CA): Ix 
PE buffer II (without MgCl2), 2 mM MgCh, 250 of each dNTP, 0.8 jiM of 
each primer, ca. 50 ng of template DNA, qsp. H2O (HPLC grade, Sigma- 

25 Aldrich) to 49.5 |al. For mutagenic PCR reactions (also referred to as error- 
prone PCR), the amount of each dNTP was reduced to 20 |uM and 0.25 mM of 
MnCla added. After denaturation at 95°C for 5 min, 0.5 [d of 5 U/|al PE Amlpi 
Taq Polymerase was added at approximately the annealing temperature of the 
primers. After mixing, the appropriate thermal cycles were carried out (as 

30 indicated individually below), using a PTC 200 thermal cycler (DNA Engine, 
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MJ Research, Cambridge, MA). When finished, all PGR products immediately 
were loaded on an agarose gel, or separated from enzyme, nucleotides and 
primers by applying the Wizard™ PGR Preps Kit (Promega, Wizard'^^ Minipres 
Plus DNA Purification System. Instruction Manual (Madison, WI) (1/96)). 
5 PGR products were recovered using deionized water and stored frozen at -20''C. 
Sequence Analysis 

Sequence analysis of plasmid constructions and ere mutants were carried 
out on a PE ABI PRISM^^ 310 Genetic Analyzer (Perkin Elmer) according to 
recommendations in the manufacturer's protocol P/N 402078 Revision A (1995) 

1 0 for the ABI PRISM'^^ Dye Terminator Cycle Sequencing Kit (Perkin Elmer). 
Briefly, a cycle sequencing reaction contained ca. 50 ng of template DNA in 
miniprep quality, 4 pmol of primer, and 8 |al of the ABI Terminator Ready 
Reaction Mix (Perkin Elmer), in a total volume of 20 |al, and subjected to the 
following conditions: (96^G, 10 s; melting temperature of primer, 15 s; 60°G., 4 

1 5 min) 26 times on a PTC 200 thermal cycler (MJ Research). After removal of 
residual primers and dye by ethanol precipitation, the DNA was resuspended 
25 |al of ABI Template Suppression Reagent (Perkin Elmer) and denatured at 
95°G for 5 min before loading the ABI Genetic Analyzer. The obtained data 
files were examined using ABI PRISM™ Sequencing Analysis Versions. 0 

20 Software (1996, Perkin Elmer), Gene Jockey II (1996, Biosoft, Cambridge, UK) 
software was used for sequence comparison, translation, and alignments. 
Mutagenesis Procedure 
Substrate Preparation by PGR 

The ere gene for the following DNase I shuffling reaction was amplified 

25 by PGR using 5' forward primer BSB436 (5' 

AAATAA TGTAGA CTGAGTGTGAAATGTGG 3') and 3' reverse primer 
BSB376 (5' ATATAT AAGGTT ATGATTTAGGGGTTAATGG 3'), 
introducing rnXba I and Hind III cloning site, respectively (underlined). 
Mutagenic and non-mutagenic PGR's were carried out: (94°G, 30 s; 52''G, 30 s; 

30 72°C, 90 s) 45 or 30 times, respectively. The 5' primer was designed to include 
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the endogenous Shine-Dalgarno (SD) of ere, whereas its three endogenous 
promoters were excluded (position -17, 6; positions refer to the adenine of the 
start codon of the ere coding sequence as position 1). Thus, after introducing 
the resulting ere genes into pBAD33 (see below), expression was exclusively 

5 under control of the pBAD promoter without interference or background 

expression due to endogenous promoters. Including the SD sequence of cr^ was 
necessary, since pBAD33 does not contain this sequence 5' of its multiple- 
cloning-site (MCS). The 3' reverse primer was designed to be homologous to 
the 3' untranslated region (UTR) of ere (position 1057, 1032). Mutagenic 

1 0 events were therefore permitted in 1 020 bp of the entire 1 026 bp ere coding 
sequence, excluding the first two codons. For the first round of the directed 
evolution procedure, the wt ere expression plasmid pBS185 (Sauer and 
Henderson, The New Biologist, 2:441-449 (1990)) served as template. In 
following cycles, the pool of mutated ere genes from the previous round was 

15 used. In all experiments, both, mutagenic and non-mutagenic PCR's were 
carried out in parallel using the appropriate template. 
Homologous Recombination in Vitro 
DNase 1 digest 

Approximately 5 |ag of the ere PGR product (ca. LI kb) wee digested 
20 with 0.03 U of DNase I, type IV (Sigma-Aldrich) in 20 jliI total volume of 50 
mM Tris-HCI pH 7.5 plus 1 mM MgCb for 2 to 3 minutes at room temperature. 
After digestion, samples immediately were loaded on a 2% minigel to separate 
the generated fragments (Figure 5). Fragments of 25 bp to 300 bp were purified 
from the gel by DE81 (Whatman, Maidstone, GB) extraction and ethanol 
25 precipitation (Sambrook et ah. Cold Spring Harbor, New York: Cold Spring 

Harbor Laboratory Press (Second Edition) (1989)), before suspending in 5 jal of 
TEpHS.O. 

Self-Printing PGR 

A 60 cycle non-mutagenic PGR (as described above) was carried out 
30 without added primers, allowing the fragments to prime themselves and thereby 
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to undergo shuffling while reassembling. Conditions for PGR were: 94''C, 90 s; 
(94^C, 30 s; 30 s; 90 s) 60 times; 10 min. 
Reassembling of ere 

Since the self-priming step never yielded a single size product but rather 
5 a range of fragments between 300 bp to 2000 bp (Figure 5), the self-priming 
PGR mixture was diluted 1/40 in a non-mutagenic PGR mix with primers 
BSB376 and BSB436 (see above), and subjected to an additional 20 cycles 
(94°C, 30 s; 52''G, 30 s; 72''C, 90 s). This additional step lead to one product of 
1.1 kb size (Figure 5). 

10 ere Expression 

After digesting the linkers of the generated mutant ere pool ^Nii\i Xba I 
and Hind III, the fragments were ligated into the identical sites of the MGS of 
the ere expression vector pBAD33 (Figure 6). Two features favored the choice 
of pBAD33 as the vector to express the mutant ere pool for the selection 

1 5 procedure (see below) : First, its p AC YC 1 84 derived origin of replication is 
compatible with the ColEl derived ones of the plasmids used in the selection 
procedure. Second, pBAD33 contains the promoter of the arabiniose operon 
(pBAD), as well as expresses the regulatory protein AraC. It is therefore 
possible to regulate the expression of a gene cloned into the MGS and under 

20 PBAD control, from moderately high levels to nearly complete regression, by 
simply changing 0.2% L-(+)-arabinose in the medium to 0.2% D-glucose 
(Guzman et al., J. Bacteriol, 177:4121-4130 (1995) and Miyada et al., Proe. 
Natl Acad. Sci, USA, 81:4120-4124 (1984). As indicated above, the primers for 
ere PGR were designed in order to include the endogenous SD sequence but to 

25 exclude the three ere promoters, ere expression therefore will be under 

complete control of the pBAD promoter. This is important for the selection 
procedure (see below) that was intended for few Cre molecules acting on 
different lox sites. High concentrations or long term background expression of 
cre could eventually defeat the selection since wt Cre also catalyzes at very low 

30 frequencies recombination events between altered lox sites. 
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Plasmids and E. coli Strains Used for Selection and Screen 

Mutant lox sites 

Figure 1 compares the original loxP site to the two mutant sites, /oxKl 
and loxKl, used during the described experiments. The lox sites with 5' Sal I 
5 and Xho I compatible, and 3' Xba I and Nhe I compatible ends were received as 
single stranded oligonucleotides from Midland and annealed by heating the 
appropriate ones together at 70°C, followed by a gradual cool down. 

Plasmids for Selection and Screening 

Plasmid pBS561 was constructed using three fragments: (i) the 5' 

10 modified neo gene derived from pBS398 (Sauer et al, Methods, 4: 143-149 

(1992), (ii) the RSVneo (Gorman et aL, Science, 221:551-553 (1983) backbone 
without the neo gene, and (Hi) the oligonucleotide-derived MCS (Figure 7). The 
EGFP gene derived from pEGFP-Nl (Clontech, Palo Alto, CA) was then 
inserted into the MCS along with 5' and 3' lox sites orientated in the same 

15 direction to produce plasmids pBS568 (loxKf) and pBS569 (/oxK2^). Figure 8 
summarizes the procedures used to construct plasmids pBS583 and pBS584. To 
restore the original neo reading frame without the 5' extension, the loxK 
cassettes containing MIu VKpn I-fragments from pBS568 and pBS569 were 
ligated into the RSV neo backbone that contains Mu I and Bgl II sites. The Bgl 

20 II - Kpn I junction was achieved by filling the 3' recessed end of Bgl II with 

Klenow (NEB) followed by a blunt-end ligation to the Kpn I end. This junction 
also was checked by sequencing and found to be correct. A transcriptional 
terminator, rrnBTiT2, derived from pBAD33 by non-mutagenic PGR with 
primers BSB425 (5' ATAAGCGGCCGCTGAGCTTGGCTGTT TTGGCGG 

25 3 ') and BSB426 (5' GCCGT CTCGAG AGAGTTTGTAG 

AAACGCAAAAAGGC 3'), was inserted into the loxK^ cassette 3' of the 
EGFP gene after digest of the its Not I and Xho I linkers (underlined). With this 
construct, it could be predicted that a catalyzed recombination by Kl"*" or Kl^ 
Cre mutants between the lox sites would result in the excision of EGFP and the 

30 transcriptional terminator, and thereby permit the transcription of the neo gene 
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due to the RSV promoter, located 5'. Expression of neo would no longer be 
impaired, because it is placed under control of 5' promoter elements present in 
RSVneo (a Kan^ rendering plasmid in E. coli), 

A similar lox?^ cassette selection plasmid also was designed (pBS613, 
5 Figure 9), to be used as a control. Using this plasmid, the frequency of loxF 
recombination by Cre mutants could be determined in the same manner as used 
to evaluate loxKl or /oxK2 recombination by pBS583 and pBS584. 

Finally, a completely different set of /oxKl^/K2^ cassette plasmids was 
created. These plasmids were no longer used for selecting mutants that 

1 0 recognize /oxKl or /oxK2, but rather were used to screen for these mutants in 
conjunction with a different bacterial background (see below). Figure 10 
summarizes the construction process for pBS601 (/oxKl^) and pBS602 
(/oxK2^): In a first step, the neo resistance marker of pBS581 and pBS582 
(intermediates in the construction of pBS583 and pBS584) was removed by 

15 deleting the Pvu II fragment and thus restoring the possibility to use neo for a 
different selection procedure. Following this, the EGFP gene between the lox 
sites was replaced by the pUC19 (Yanish-Perron, et. al, 1985) derived lac 
promoter with its 3' MCS. This pUC19 fragment was obtained by non- 
mutagenic PGR with primers BSB448 (5' GTCAA GCTAGC TAGCAGGTTT 

20 CCCGACTGG 3') and BSB449 (5' ACAT TGCGGCCGCAGATCT CCTCTA 
GAGTCGACCTG 3 '). An Nhe I site 5\mdaBgl II and a Not I site 3 ' 
(underlined) were introduced thereby, which made it possible to replace the Nhe 
I - Not I EGFP fragment after linker digestion. The newly generated polylinker 
between the two lox sites permitted insertion of the Xba I - BamR I fragment of 

25 pBS481 (that carries the abf A,st marker gene) into its Xba I and Bgl II sites. As 
shown recently, E, coli strains expressing a recombinant a-L- 
arabinofuranosidase gene from Streptomyces lividans* (abf A,st), can be detected 
by eye on LB plates containing 5-bromo-3-indolyl-a-L-arabinofuranoside (Z- 
ara) Berlin and Sauer, Anal Biochem,, 243:171-175 (1996). This leads to the 

30 formation of an indigo blue pigment, that is similar to the classical lacZ/5- 
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bromo-4-chloro-3-indolyl-p-D-galactopyranoside (X-gal) marker system. It 
could therefore be expected that E. coli strains expressing mutant Cre 
recombinases which allow recombination of loxYA or /oxK2 should lose the 
abf A,st gene and should form white colonies on Z-ara plates. All other clones, 
5 however, should be dark blue. Using abf A.st instead of the well established 
lacZ screen was necessary, because the E. coli strain (see below) used for this 
construct expresses p-galactosidase endogenously. 
E, coli Strains for Selection and Screen 

The E. coli strain BS583, DH5 Alac (^D69 lox?\lacZ LEU2]), was 

1 0 chosen as the bacterial background for the selection procedure for Kl"^ or K2"^ 
Cre mutants by plasmids pBS583 or pBS584. Due to the lox?\lacZ] containing 
X prophage, Cre activity on lox? can be evaluated simply by using X-gal plates. 
The selection strains BS1493 and BS1494 were made by introducing the 
selection plasmids pBS583 and pBS584 into BS583 (Table 2). The lox?2 

15 plasmid pBS613, to be used as a control, needed to be in BS583 cells as well, 
becoming the strain BS1541 (Table 2). 

For the screening plasmids pBS601 and pBS602, E. coli strain NS2300 
(Sternberg et ah, I Mol Biol, 187:197-212 (1986)) was selected as host: K12 
rec A:\Tn\Q Qd^^^ lox^llneo]). This strategy combines a kanamycin selection 

20 for Cre enzymes, that are no longer active on loxV (P"), with a screen for Kr or 
K2'' enzymes. By transforming pBS601 and pBS602nnto NS2300, the P' 
selection strains BS1523 and BS1524 were formed (Table 2). 
Selection and Screen for CRE Mutants 
Selection for Kl'^/K2^ and Screen for F 

25 After ligation of the generated mutant cre pool into pBAD33 for 3 h, 

BS1493 or BS1494 electrocompetent cells were transformed with 2 |ul of the 
microdialyzed reaction mixture (VS membrane, MilliporeTM- Bedford, MA). To 
induce expression of the cre pool, the transformed cells were incubated at 37''C 
in induction medium for 2.5 h and/or 4 h under agitation (as mentioned before). 

30 Cultures were diluted 1/500 or 1/5000 and grown on LB plates with the 
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following formulation: Ap, Cm, glucose, and X-gal for determining the 
transformation efficiency, referred to as non-selection plates. Dilutions of 1/5 
and occasionally 1/50 were grown on plates with addition of Kan, used to select 
for Kl^ or K2'" mutants and called selection-plates. The formulation of the 
plates served the following purposes: (i) Ap and Cm were added to assure that 
all clones contained both, selection and expression plasmid, (ii) X-gal to 
distinguish between P"" and P' clones (Table 2). After overnight incubation at 
ST'^C, blue and white colonies were counted and pools prepared for the next 
round of DNA shuffling. Alternatively, certain mutants were chosen for further 
analysis (see below). 

Selection for P' and Screen for Kl^ or K2^ 

After 2.5 h and/or 4 h of expression of the mutant ere pool in the 
transformed BS1523 and BS1524 cells, usually dilutions of 10'^ or lO'"^ were 
grown on LB agar plates supplemented with the same reagents as listed above, 
except, that Z-ara replaced X-gal to allow the Kl'^/Kl' screen. Non-selection 
plates were used for determining the transformation efficiency, and Kan 
containing plates for the P" selection (Table 2). 

Mutant Analysis 

wt ere Expression Plasmid 

With fewer cycles (15) of non-mutagenic PCR on the ere expression 
plasmid pBS185 and after linker digestion, the ere pool obtained was cloned 
into pBAD33 and transformed into BS583 cells. After 2 h of ere expression, the 
transformed cells were grown on X-gal plates. After overnight incubation at 
37''C, two white colonies (indicating lox? recombination) were picked for 
plasmid preparation and complete sequencing. No point mutation was found in 
either one, so that each could be used as a control plasmid for wt Cre expression. 
One of the two was selected for further use and named pBS606. 

Functional Testing 

In order to determine the frequency of lox recombination of isolated 
mutant Cre enzymes by the described selection procedure, it is necessary to 
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separate the ere candidate expression plasmid (pBAD33) from the selection 
plasmid of the chosen Kan^ candidate. Then, the cleaned expression vector can 
be used to retransform the appropriate selection and screening strain BS1494, as 
well as BS1493 and BS1541 to determine the candidate's capacity for loxK2, 
loxKl and lox? recombination under identical conditions. By comparing the 
resulting frequencies of Kan^ of different Cre mutants and wt Cre, all treated 
identically, one can determine quantitatively how well each chosen mutant 
candidate really performs on the altered lox sites. 

Therefore, overnight cultures of candidates were grown in liquid LB 
supplemented with Cm and Kan for plasmid minipreps yielding a mixture of 
both the mutant Cre expressing plasmid pBAD33 and the newly Kan^ selection 
plasmid. In order to eliminate the latter, minipreps were digested with the 
restriction enzyme ^a;^ II which only cuts the selection plasmid. After 
transformation of BS583 cells with this digestion mixture and approximately 2 h 
cre expression, different dilutions were grown on LB agar plates supplemented 
with Cm and X-gal to select for pBAD33. Plates with Ap plus Cm were used to 
determine the background of contamination with uncut selection plasmid. The 
next day, clones obtained by the Cm selection were tested for Ap' and Kan' to 
confirm the elimination of the selection plasmid. A final overnight culture, 
followed by a miniprep procedure, yielded the unique plasmid for functional 
testing, as described above. 

Sequencing 

To obtain the DNA sequence of candidate cre genes in pBAD33, eight 
primers (BSB454 to BSB461, Table 3), four for each strand, were designed so 
that the entire gene could be sequenced in both directions. 

Site-Directed Mutagenesis 

After identifying one essential mutation for the decrease in substrate 
specificity, the Stratagene QuickChange™ Site-Directed Mutagenesis Kit 
(Stratagene Cloning Systems, La Jolla, CA) was used to create cre mutants with 
mutations at the determined location, only. Using three different mutant primer 



sets (BSB 465 to BSB 470, Table 4), all steps were carried out as detailed in the 
manufacturer's instruction manual, except that electrocompetent BS1494 cells 
were used for transformation and mutant selection, replacing the provided XLl 
blue cells. The made mutant candidates were subjected to functional testing and 
sequencing as detailed before. 

In a different experiment, the DNA shuffling mutagenesis procedure was 
repeated on wt ere by adding one 5' phosphorylated strand of each set of mutant 
oligonucleotides (BSB465, BSB467 and BSB469) to the pool of small 
fragments prior to reassembly. This allowed to incorporate them into the 
resulting ere pool. The desired mutations should consequently be introduced at 
much higher frequency than without the addition of oligonucleotides. 

RESULTS 

Establishment of the Selection Procedure 
Selection Plasmids (pBS583 and pBS584) 

To test the generated plasmids pBS583 and pB584, a recombination 
event between their lox sites was mimicked by digestmg pBS566 and pBS567 
(intermediates of the pBS583/584 construction, containing only the 3' lox site) 
with Sal I an.dXho I, followed by religation. This lead to excision of the EGFP 
gene and terminators. After deletion of EGFP and rmBTiT2, the Kan'^ 
phenotype was observed as anticipated. In addition, the frequency of 
spontaneously occurring Kan'^ clones carrying the original plasmids was 
approximately 10"^. This background is inconsequential, since the 
transformation efficiency of BS583 cells was determined as 10^ per \i,g of 
pBAD33. 

The equivalent /oxP control plasmid pBS6 1 3 was tested directly with the 
wt ere expression plasmid pBS606. After 2.5 h of ere expression, 94% of all 
clones were determined Kan^ and about 6% showed blue color. Without ere 
expression, no Kan'^and no white colonies were observed. This confums that 
the control cell line BS1541 (Table 2) permits the combined P^ selection and P" 
screen. 
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Screening Plasmids pBS601 and pBS602 

Only pBS602 was tested before use by expression of wt Cre and a 
Kl^/P^ Cre mutant (see below). On non-selection medium, wt Cre expression 
resulted in more than 95%, whereas expression of the mutant Cre resulted in less 
5 than 3% of blue colonies. This indicates that excision of the loxK2 flanked 

abf A,st marker by K2"^ Cre is possible. On selection medium, very few colonies 
could be found, since both types of cre have shown activity on lox? before (see 
below). 

Mutagenic vs. Non-Mutagenic PGR 

1 0 The frequency of P' Cre mutants obtained after non-mutagenic and after 

error-prone PGR was determined by the following experiment: After one 
mutagenic or one non-mutagenic PCR on the wt Cre expression plasmid 
C3 pBSl 85, the resulting cre pools were inserted into the expression vector 

il pBAD33 and transformed into BS583 cells. After 2.5 h of arabinose-mediated 

% ■ 15 induction or glucose-mediated repression (by SOC medium) or cre expression, 

Q dilutions were transferred to LB plates with Ap, Cm, glucose and X-gaL The 

results are presented in Table 5: Under glucose repression, exclusively blue 
;^ colonies could be identified (first line in Table 5), indicating that cre expression 

eip is insufficient for lox? recombination and excision oflacZ of BS583. Induction 

20 with L-(+)-arabinose, however, lead to the formation of white colonies at the 
□ presented fi-equencies (second line), indicating that (i) the described control of 

cre expression by pBAD33 is functioning, and (ii) the mutagenic PCR 
conditions cause three times more impaired Cre enzymes for lox? recombination 
than the non-mutagenic conditions (60%) blue colonies vs. 30%). ft is worth 
25 mentioning that ligation reactions lacking cre insertion resulted in 50 to 100 

times less blue colonies than obtained with the ligations with cre insertion. This 
phenotypically blue background of empty pBAD33 was subtracted before 
calculating the presented data. 

Leung et al.. Technique, 1:11-15 (1989) reported that the frequency of 
30 point mutations created by error-prone PCR is about 0.5%. If this is true, in 
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average five point mutations should occur in each 1 kb ere coding sequence 
subjected to an error-prone PCR. By extrapolating this data to the three times 
less P" enzymes after a non-mutagenic PCR, one can conclude that the frequency 
for point mutations should also be reduced by a factor of three to 0.18%. 
Experiments made by Zhou et. al. (1991) showed 11% of a 633 bp marker gene 
phenotypically impaired after non-mutagenic PCR. About 37% of all genes in 
the pool, however, carried at least one point mutation. Even though the 
conditions for the non-mutagenic PCR were similar, the observed discrepancy 
between 11% and 20% of phenotypical mutants may be due to a variety of 
reasons, among which: (i) the size difference between the two genes (633 bp vs. 
1020 bp), (ii) different elongation times during PCR, and (Hi) different 
sensibility of the two proteins for disabling point mutations. 
Testing wt Cre on loxKl^ vs. loxKl^ Substrates 
The level ofloxKl and loxK2 recombination due to wt cre expression 
was determined using the wt Cre control plasmid pBS606. After transformation 
of the cell lines BS1493 and BS1494 with pBS606 and 2.5 h and 4 h of cre 
expression, cells were grown on selection and non-selection plates (as described 
previously). The recombination frequency between the ahered lox^ sites was 
considered equal to the observed frequency of Kan^ phenotype: for loxKl, it 
was about 10^^ after 2.5 h and 2 X lO"^ after 4 h of wt cre expression, for loxK2, 
it changed from about 2 X 10"^ to 2 X 10"^ All colonies found were white, 
indicating effective loxP recombination by wt Cre within the allowed expression 
time. This result shows that long term expression of the wt enzyme permits a 
sHght increase in recombination between the altered lox sites. The use of 
pBAD33 to avoid background lox recombination by suppressing cre expression 
is therefore justified. Because loxK2 was eventually 100 fold better recognized 
by wt Cre than /oxKl (but still at low frequency), first the creation of novel Cre 
recombinases with loxKl specificity was attempted. 
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Mutagenesis on loxKl 

First Rounds of Directed Evolution 

The result of the first four rounds of the described mutagenesis 
procedure on /oxK2 with selection plasmid pBS584 are presented in Table 6. 
The following symbols are used to describe the status of the DNA shuffling 
procedure for ere: 'V' indicates a non-mutagenic PGR, "m" a mutagenic PGR, 
and ''x" stands for the in vitro reassortment event. For example, mxoxox ere 
represents a ere pool subjected to three rounds of the directed evolution process, 
with a mutagenic PGR followed by in vitro shuffling in the first, and non- 
mutagenic PGRs and shuffling as mutagenic and recombinogenic events in the 
following two rounds. The phenotypically blue background due to empty 
pBAD33 was subtracted from all results by control ligations without ere 
insertion. In every round, error-prone and non-mutagenic PGR served as the 
necessary mutagenic event on the template pool of the previous round. After in 
vitro reassortment and selection, always the larger Kan^ population of the two 
parallel experiments was chosen as template for the next round (as indicated in 
the last column of Table 6). Only in the first round the error-prone PGR could 
lead to more candidates, whereas in all following rounds the reduced mutagenic 
frequency of the non-mutagenic PGR turned out to be more beneficial. The 
density of point mutations resulting out of two mutagenic PGR's was obviously 
too high to allow efficient elimination of deleterious mutations from 
advantageous ones during the in vitro shuffling step. This is confirmed by the 
high frequencies of blue colonies found within any pool in any round subjected 
to mutagenic PGR's twice. Error-prone PGR in the context of the applied 
selection therefore appears to be useful in the first round only, where its three 
times higher mutagenic fi-equency increases the amount of beneficial mutations 
compared to a non-mutagenic PGR. With increasing cycle numbers non- 
mutagenic PGR's should be preferred to avoid high densities of deleterious 
mutations. 
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The established directed molecular evolution process allows effective 
evolution of ere. Column five of Table 6 shows that with every round the 
number of Kan^ colonies is increasing, while the time for ere expression could 
be lowered from 4 h to 25 h (column three). After only three rounds, Cre 
mutants capable for /oxK2 recombination at decreased concentrations due to the 
reduced expression time were found. However, it was not possible to isolate 
any blue colony on the selection plates. All identified KI^ mutants therefore are 
also P"^. As mentioned above, high densities of deleterious mutations in the ere 
pool subjected to error-prone PCR's twice could explain why no blue colonies 
were seen on selection plates, even with over 90% of P' candidates on non- 
selection plates. 

Evaluation of Six K2''/F^ cre Mutants 

Functional Test 

As indicated in Table 6, 36 white Kan^ colonies could be isolated from 
the mxoxox cre pool after only 2.5 h of ere expression with an 1/5 dilution 
grown on selection plates. This result indicated that competent Cre mutant 
capable of /oxK2 recombination were produced. Six were selected for further 
analysis. After elimination of the selection plasmid from the minipreps (as 
described in Materials and Methods), all six of them, ^s well as wt Cre (pBS606) 
were subjected to the described fiinctional test on /oxK2, but also on loxKl and 
loxP recombination with plasmids pBS584, pBS583 and pBS613. The resuhs 
are presented in Table 7, 

Briefly, by selecting for /oxK2 recombination, all mutants except 
mxoxox 4, showed significantly increased percentages of Kan^ (between 3% 
and nearly 70%), compared to wt Cre (0.002%>), as indicated in column three of 
Table 7. This indicates a lO"^ to over 10^ fold increase in activity on /oxK2. 

On /oxP, all (including mxoxox 4) showed recombination frequencies 
between 80% and 100% after 2.5 h of cre expression (column 4). This was 
expected from the results obtained with the X-gal screen for lox? recombination 
during the selection procedure. 2.5 h of induction for wt ere expression is 
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therefore sufficient for almost complete lox? recombination, justifying 2.5 h of 
expression of mutant ere pools for selecting competent Kl^ Cre mutants. The 
observed slight decrease in lox? recombination with the mutants mxoxox 3 to 6 
either derived from usual variations during experiments, or may indicate a 
slightly reduced lox? activity. With BS1541 blue colonies on both selection and 
non-selection plates were found in approximately the same frequency (2% to 
20%) as kanamycin sensitivity (Kan^). This indicates competition between the 
lox?\lacZ] site on the genome and the lox?\EGFP-rrnBT 1T2] sites on pBS613. 
Since the same Cre mutants never resulted in blue colonies during selection for 
loxKl recombination in cell line BS1494, it is possible to conclude that loxV is 
still preferred over loxKl, This argument is supported by higher frequencies of 
lox? recombination, close to 100%, compared to the loxK2 recombination 
frequencies of 3% to 70%. 

All frequencies found for loxKl recombination, determined by using cell 
line BS1493, lie below 0.01% after 2.5 h, as well as after 4 h of induction of cre 
expression (column five). This indicates that no analyzed mutant developed an 
increased activity on lox Kl compared to wt Cre. 

To summarize, five out of the six analyzed mutants showed a significant 
decrease in specificity, resulting in the possibility for lox? and loxKl 
recombination. 

Sequencing 

The six described mutants have been completely sequenced in both 
directions to determine the mutations which lead to the observed decrease in 
specificity. The resulting aligned cre coding and aa sequences of all mutants 
and wt Cre are represented in Tables 1 0 and 1 1 . Each mutant showed between 3 
and 8 point mutations, altogether 3 1, as listed in coIuoti two of Table 8. The 
overall mutagenic frequency can therefore be calculated at 0.5% (3 1 mutations 
in 6 clones of 1020 bp), which is similar to only one round of error-prone PCR 
(Leung etal., Technique, 1:11-15 (1989)). The reason for the low frequency of 
point mutations after three rounds of the mutagenesis procedure (i.e. nine 



9973 5 Sv I 



74 



OMRFi78 
20487/247 



PCR's) is the applied selection after each round, ere mutants with low density of 
point mutations seem to be favored by the stringent kanamycin selection, 

26 of the 3 1 identified point mutations resulted in aa changes compared 
to the wt sequence, as indicated in column three. No deletions or frame-shift 
5 mutations, as well as no codons affected by more than one point mutation at the 
same time could be identified. All possible transition events could be observed, 
but only half of all possible transversion events (sho^vn in Table 12). Adenine 
to guanine and vice versa transition events represented almost 30% of all 
identified point mutations. All other events occurred less frequent (10%, 7%, or 

1 0 never). The represented statistic may however be biased, either since only six 
mutants were analyzed, or due to the directed molecular evolution technique 
itself: The types of point mutations observed less often, may more frequently be 
deleterious in ere and were consequently removed from the pools. With more 
mutants to be sequenced, this question could be addressed further. 

1 5 Only one point mutation, a transition event fi-om adenine to guanine at 

position 785 in the ere coding sequence is common for all 5 mutants with 
remarkably increased /oxK2 activity. This mutation leads to a replacement of a 
glutamate residue at position 262 in the J helix of wt Cre by a glycine (indicated 
in column five of Table 8). This glutamate is believed to contact the lox? 

20 sequence at positions 1 1 or 12 with its acidic side cham (Guo et al., Nature, 
389:40-46 (1997)). Another point mutation, resulting in a conservative 
threonine to serine exchange at position 316 was identified in three mutants. 
Five point mutations were found independently in two of the six mutants, among 
which two types of silent mutations. Finally, eleven point mutations occurred 

25 only once (indicated in column four). Therefore, the critical mutation for loxKl 
activity appears to be E262G. Some of the additional mutations could be 
responsible for the observed ten fold difference in loxK2 recombination 
frequency among the five E262G carrying mutants. 
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Site-Directed Mutagenesis to Verify Results 
Site-Directed Mutagenesis Procedures 

To determine whether the E262G mutation alone is responsible for the 
increase of /oxK2 activity of the Cre mutants by at least a factor of 10^, two 
different experiments were made: 

First, the described directed evolution procedure was repeated in three 
different sets on wt cre, by adding three 5' phosphorylated mutant 
oligonucleotides prior to reassembly: Incorporation of the first oligonucleotide 
(BSB465) into cre should lead to the E262G mutation, incorporation of the 
second one (BSB467) to a E262A mutation, and the equimolar mixture of 
random oHgonucleotides (represented by BSB469) to 20^ 8000) possible aa 
combinations at positions 261 to 263 of Cre. According to Stemmer, W. P. 
Proc. Natl Acad Scl USA, 91:10747-10751 (1994), these oligonucleotides 
should be incorporated during reassembly and cause the desired mutations at a 
frequency of about 8%. After insertion of the resulting cre pools into pBAD33 
and 2,5 h of expression, 0.8% of white Kan^ colonies were found with BSB465. 
Therefore, the fi-equency of the Kan^ phenotype due to loxKl recombination is 
ten times lower than the expected frequency for the E262G mutation to occur. 
This indicates that the E262G mutation increases the specificity for loxKl from 
0.002% of wt Cre (see above) to approximately 10%) of recombination during 
the standard expression experiment (2.5 h of cre expression prior to selection). 
By using the second oligonucleotide (BSB467) only 0.09% of the total white 
colonies showed Kan^. Thus, E262A still favors lox K2 recombination but by 
almost a factor ten less efficiently than the E262G mutation. With the random 
oligonucleotide mixture (BSB469), the frequency of /oxK2 recombination 
shrunk to 0.02%. Compared to a control experiment with no oligonucleotides 
added, resulting in 0.003% Kan^ (consistent with the results obtained below), 
still some of the possible 8000 aa combinations at positions 261 to 263 of Cre 
are expected to favor /oxK2 recombination. Since the frequency of blue 
colonies on non-selection plates was about 30%, a clear indication for additional 
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mutations, it cannot be completely excluded that some of the occurred Kan^ 
mutants of this experiment have been carrying some additional beneficial 
mutations. Nevertheless, this experiment indicated that the E262G mutation is 
probably the basis for significant increase in loxK2 activity. 

For better defined mutations, a second experiment was done: The same 
oligonucleotides, now in sets for both DNA strands, were used with the 
Stratagene QuckChange™ Site-Directed Mutagenesis Kit as described in 
Materials and Methods. After transformation of the loxKl selection strain 
BS1494, the percentage of white Kan'^ colonies in the three different 
experiments could be determined as 6.2% for E262G, 0.8% for E262A and 0.6% 
for the 8000 different ten (10) days combinations as positions 261 to 263. Due 
to the QuickChange'^^ procedure, which eliminates parental DNA, one could 
expect that the desired mutations occurred in at least 50% of all clones of the 
three different pools. This assumption indicates that the E262G mutation alone 
results in approximately 12% or less recombination frequency on loxK2 (6% out 
of 50% or more carrying the mutation) and the E262A exchange in 1.6% or less 
(under standard conditions). This calculation is not valid for the third set of 
oligonucleotides, because no defined mutation is introduced but rather a mixture 
of 8000 different ones. Blue colonies were not found during this experiment, 
confirming that the frequency of additional mutations altering Cre activity on 
loxP was very low. 

The results of both experiments indicate consistently, that the E262G 
mutation alone is sufficient to increase loxKl recognition by approximately a 
factor of 1 0^ compared to wt Cre. The ten times higher frequency observed with 
three of the six analyzed mutants after three rounds of the mutagenesis 
procedure can only be explained with additional beneficial mutations. The 
E262A mutation increases the fi-equency of loxK.2 recombination also, but 
approxunately by a factor of ten less effectively than E262G. 
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Sequencing and Elimination of Possible Additional Mutations 

Two white Kan^ colonies of each pool derived from the site-directed 
mutagenesis procedure were selected and the entire ere gene was sequenced. 
Both E262G and E262A candidates showed the desired sequence with no 
additional mutations. The random candidates surprised: One of them did not 
reveal any point mutation, representing an artifact which managed to survive the 
selection, whereas the other one showed nucleotide alterations from position 783 
to 786. The wt sequence (ysiCTG GAA786) was found to be changed to (733 CTT 
TGG786), resulting in one silent mutation, conserving L261, and a E262W 
exchange. To exclude possible mutations in pBAD33 due to the PGR involved 
site-directed mutagenesis procedure, the three defined ere mutants (E262G, 
E262A, and E262W) were excised by Hind III andXba I and reinserted into the 
MCA of fresh pBAD33. 

Functional Testing 

The three defined Ore mutants for the amino acid position 262 were 
subjected to a functional test for loxK2, loxKl and lox? recombination activity, 
as mentioned before. The results are summarized in Table 9. First, the results 
described previously for the E262G and E262A mutations were confirmed: As 
indicated in column three, the /oxK2 recombination frequency increased 10^ fold 
with the E262G mutant compared to the wt enzyme, whereas the E262A mutant 
shows only an increase of 200 fold. Surprisingly, the E262W mutant also 
achieved a similar activity on /oxK2 as seen with the E262G mutant. The test 
for lox? recombination frequency with cell line BS1541 (column four) showed 
that the ability for loxP recognition is at best slightly impaired by the three 
different mutations (as already seen with the analyzed mxoxox mutants). Again, 
blue colonies could only be found with this cell line, indicating that lox? is 
preferred over loxK2 (as described before). As expected, none of the three 
mutants performed significantly better on loxKl than wt Cre (column five). To 
conclude, this final experiment provides the necessary evidence that the E262G 
mutation presents the basis for the observed decrease in specificity of Cre. 
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However, additional mutations seem to be helpful to increase the newly obtained 
activity further. In addition, glycine at position 262 is not the only possible 
residue to permit a remarkable increase in loxK2 activity. 

DISCUSSION 

K2^/P^ Cre Mutants 

The chosen random mutagenesis procedure linked to the described 
selection in E, coli allowed the identification of Cre mutants, characterized by a 
wider substrate recognition. The evolved enzymes showed wt-like activity on 
loxV sites and in addition had almost the same activity on altered lox^ sites, 
referred to as /oxK2. By contrast, the wt enzyme showed only marginal activity 
on loxKl^ substrates. 

lox Sites 

lox? and ioxK2 differ at several locations as illustrated in Figure 1 : First, 
the three outermost bp of the inverted repeats are altered, facilitating the 
construction of the various plasmids used for selection. Second, the entire non- 
canonical 8 bp spacer is completely exchanged, and third, two transversion 
events (thymine to adenine) are introduced at positions 1 1 and 12 of the lox site. 
Only the two transversion events are considered important for inhibiting wt Cre 
from recognizing the site. Other investigations have shown before, that the two 
alterations of lox? mentioned first are without inhibiting effect on wt Cre (B. 
Sauer, unpublished results). The design of loxKl was based in part on so-called 
cryptic lox sites, that were identified previously in the yeast genome (Sauer, B., 
J. Mol Biol, 233:91 1-928 (1992)). Another consideration was the choice of a 
good starting sequence for the described mutagenesis procedure. Starting with 
sites that contained several and/or widespread alterations oilox? was avoided, 
because the greater the number of alterations in the substrate, the more the 
enzyme would have to be altered. Consequently, to most effectively use the 
mutagenesis procedure, the two sites presented in Figure 1, loxKl and loxK2, 
were designed to have only two critical alterations. In initial experiments, wt 
Cre was found to recombine loxYJ} substrate pBS584 slightly better than 
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loxKl^ pBS583. This difference may depend on the fact that in loxK2 the two 
alternations are located next to each other, while in loxKl they are separated by 
3 bp (positions 14 and 10). Thus, /oxKl could interfere with wt Cre binding at 
two distinct DNA-protein interaction sites as compared to /oxK2, where only 
one location of incompatibility is available. For this reason, /oxK2 was chosen 
for the initial set of experiments, described in this work. 

Cre Mutants after Three Rounds of Directed Evolution 
Three iterations of the in vitro evolution procedure were necessary to 
identify 36 candidates, expressing Cre mutants that could process loxK2 (based 
on the applied selection procedure in E, coli). Tests showed that five out of six 
selected ones had 10^ and lO"^ fold increased activity on loxKl when compared 
to wt Cre. On lox? and /oxKl, however, there was almost no difference 
between wt and the mutant enzymes. The mutants therefore had developed an 
increased tolerance from transversions at positions 1 1 and 12 (loxKT) of the lox 
sequence, but not for other positions like 10 and 14 (/oxKl). To indicate this 
phenotype they were referred to as K2VP'^. 
The E262G Mutation 

Sequence analysis identified that the five mutants with remarkably 
enhanced /oxK2 activity (10^ to 10^ fold compared to wt Cre) had in common 
only one point mutation, leading to the aa change E262G. Site-directed 
mutagenesis experiments confirmed, that the E262G mutation is sufficient to 
increase /oxK2 activity by a factor of 10^ compared to wt Cre. 

Based on the recently described crystal structure (Quo et al, Nature, 
389:40-46 (1997)) glutamate at position 262, located in the J helix of the 
enzyme, may be a DN A contacting residue and permit the formation of a 
hydrogen hond between the carboxyl group of its side chain and an amino group 
of one of the two adenines at positions 11' or 12' in the /oxP sequence (Figure 
1). However, changing these two bases to thymines in the /oxK2 sequence 
should lead to an electrostatic repulsion between their oxygens and the acidic 
side chain of glutamate. This could explain the observation that wt Cre is 
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unable to catalyze a recombination between two loxK2 sites. Exchanging 
glutamate for a glycine, that does not have a side chain, should remove the 
electrostatic repulsion and thereby permit loxKl binding. On the other hand, 
this alteration could affect lox? binding because an electropositive DNA-protein 
5 interaction may be lost. Results from the mentioned experiments support this 
proposition: The E262G mutation alone lead to an increase in loxK2 
recombination from 0.002% to about 2% and to a slight decrease in lox? 
recombination from 94% to about 80%. The 50 times lower frequency of /oxK2 
compared to lox? recombination may depend on the second thymine or more 

10 likely on the complementary adenine (position 1 1 in loxK2) that could 

contribute to sterical repulsion between the J helix and the loxKl site. To prove 
this hypothesis, the role of arginine at position 258, located one helix turn away 
from the glutamate, should be further investigated by site-directed mutagenesis. 
As proposed by Guo et al., Nature, 389:40-46 (1997), R258 is a DNA contacting 

1 5 residue that forms hydrogen bonds with the guanine at position 1 0' of lox? and 
may also interact with the bp at position 1 1 . There is yet no confirmatory 
experimental evidence for this proposal. 

Results from three initially isolated mutants (mxoxox 1 to 3) indicated 
about 50% of recombination frequency on loxK2. This is about ten fold higher 

20 than that obtained with the E262G mutation alone. It is therefore likely that 

some of the additional point mutations identified in these three mutants account 
for this increase in activity. Table 8 lists all point mutations that were found. If 
silent and conservative mutations are considered not to influence specificity, 
only a limited number of candidates to account for the phenotype remains. 

25 Among these, S254G and Q255R of mxoxox 2 and 3, because of their location 
close to the amino-terminus of the J helix, could be expected to influence DNA 
contacts with positions 11 or 12 of the lox site. The other mutations are 
scattered in the N- and C-terminal domain of Cre. All, except RIOIQ of 
mxoxox 5, affect aa that are not located within the proximity of DNA contacting 

30 areas. Some appear independently in two mutants, e.g. D29A or Dl 89N, that 
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could influence protein folding or the interactions among the four Cre enzymes 
necessary for recombination. Such alterations could influence for example the 
orientation of the J helix and thereby reduce remaining interference between the 
/oxK2 site and the enzyme. Alternatively, some mutations, also silent ones, 
5 could influence protein expression, leading to a faster- accumulation of enzymes 
and consequently to higher recombination frequency. This possibility should 
however also influence lox? recombination. In fact, after 2.5 h of cre 
expression, the mutants mxoxox 1 and 2 showed a slightly higher frequency of 
loxP recombination compared to wt cre (98% vs. 94%). This difference, on the 

10 other hand, may be attributed to the variations that occur normally within 

experiments. To address this question further, shorter cre expression times on 
lox? would be required. 

When the coordinates of the crystal structure (Guo et al, Nature, 389:40- 
46 (1997)) are available (protein data bank, Brookhaven National Laboratory), it 

1 5 will be possible to confirm many of the tenets discussed. 

Finally, the increase in loxK2 tolerance between the E262G mutation 
alone and the isolated mutants carrying additional point mutations justifies the 
use of the DNA shuffling procedure linked to selection: Not only has it 
permitted the elimination of deleterious mutations from the sequence pool, but it 

20 helped to accumulate various more or less beneficial aa alterations as well. 
The E262A and E262W Mutations 

The mentioned site-directed mutagenesis procedure was used to generate 
two more defined Cre mutants, E262A and E262W. Compared to the E262G 
mutatiSon, E262A permitted loxKl recombination ten fold less effectively. The 
25 E262W mutation however resulted in similar activity on /oxK2. 

The aliphatic side chain (a methyl group) of A262 could be the reason 
for slight sterical interference. This would explain the observed reduced 
frequency of /oxK2 activity with E262A. lox? recognition, however, could not 
be found to be affected compared to E262G. The lowered loxK2 activity 
30 explains why no E262A mutation was identified in the small pool of six 
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analyzed mutants: With a ten fold decrease in activity, one would expect to 
encounter the corresponding mutation ten times less often during selections as 
well. 

In contrast to the small side chain of alanine, the large aromatic side 
5 chain of W262, was expected to inhibit recombination between any lox sites due 
to sterical interference. Surprisingly, this seems not to be true. A possible 
explanation for the observed activity on loxV and /ojcK2 could be that the 
aromatic and planar structure of the tryptophan side chain, fits better into the J 
helix - lox interface than does a methyl group. Different influences of A262 and 

10 W262 on folding of the J helix could also contribute to the observed phenotypes. 
The reason why the E262W mutation was not identified among the six analyzed 
mutants is the genetic code. Whereas for the E262G and E262A mutations only 
one bp in the glutamate encoding GAA codon needs to be mutated to GGA or 
GCA, for E262W the whole codon must be exchanged to TGG. This is unlikely 

15 to occur during the applied random mutagenesis procedure with a mutation 

frequency of 0.5%, Other amino acid changes due to two or three mutations of 
the glutamate encoding codon therefore cannot be considered to have occurred 
during the random mutagenesis procedure. 



20 interesting to confirm and further investigate the discussed hypothesis. 



When the coordinates of the crystal structure are public, it will be 
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5-BI-ara a.k.a Z-ara 



5-bromo-3-indoyl-a-L-arabinofuranoside 



aa 



amino acid 



Ap(Ap^Ap') 



ampicillin 
base pairs 

chloramphenicol (resistant, sensitive) 
cyclization recombination {gene - Protein) 
Dalton 



25 bp 



Cm(Cm^, Cm^) 



ere ~ Cre 



Da 



DMSO 



dimethylsulfoxide 



30 



ds 



double stranded 
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GF gel filtration 

Kl"^ or K2^ Cre mutant capable for loxKl or loxKl 

recombination 

Kan (Kan , Kan kanamycin (resistant, sensitive) 

5 lox locus of crossover 

MCS multiple cloning site 

mRNA messenger RNA 

neo - Neo neophosphotransferase (gene - Protein) 

P" or P"^ Cre mutant defective or capable for lox? 

10 recombination 

PGR Polymerase Chain Reaction 

SD Shine-Dalgamo sequence 

ss single stranded 

TK thymidine kinase 

1 5 wt wildtype 

X-gal 5-bromo-4-chloro-3-indolyl-P-D-galactopyranoside 



Example 2: Analysis of Variant Cre Recombinases 

This example describes analysis the activity of several specific variant 
20 Cre recombinases. 

Vectors 

pBS606, 614, 626, 627, 628 and 650: pBAD33 with wt, E262G, 
E262G/D29A, E262G/D189N, E262G/T316S, and R3M3 cre insertion used for 
expression of the corresponding Cre proteins in DH5a for in vivo testing. 

25 pBS632 to pBS641 : pUC 1 9 based plasmids for in vivo tests of different 

Cre mutants on a variety of different lox sites and combination of lox sites, all 
bearing the FASl spacer (Sauer B., Nucleic Acids Res., 24:4608-4613 (1996)). 
Recombination between two lox sites leads to excision of a neo cassette to give 
kanamycin sensitive E. coli. The same plasmids were also used for in vitro 

3 0 recombination experiments . 
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pRH200\ wt Cre expression plasmid (a generous gift from Ron Hoess, 
DuPont, Wilmington, DE) used to overexpress wt Cre in BL21(DE3) (Novagen, 
Madison, WI) strain. 

pBS654 to pBS658: wt cre of pRH200 was replaced with different cre 
mutants (E262G, E262G/D29A, E262G/D189N, E262G/T316S, and R3M3) 
using Age I and Ml/u I restriction sites. 

E. coll Strains 

BS583: The E, coli strain BS583, DH5 Mac QJD69 loxP\lacZLEm]\ 
was chosen as the bacterial background for the selection procedure using 
plasmid pBS584. Due to the loxP\lacZ\ containing X prophage, Cre activity on 
loxP can be evaluated simply by using X-gal plates. 

BS1494: The E. coli strain for selection was established by introducing 
the selection plasmid pBS584 into BS583. Thus, BS1494 allows a kanamycin- 
selection for loxK2 and in parallel a blue/white-screen for loxP recombination 
with 5-bromo-4chloro-3-indolyl-p-D-galactopyranoside (X-gal). Note that the 
spacer region of the loxK2 site (FASl) is different from the original one of loxP. 
Thus, recombination events between loxP of the X prophage and loxK2 of the 
selection plasmid pBS584 catalyzed by potent Cre mutants are excluded in 
BS1494. 

BS1576 to BS1581: For the in vivo recombination experiments wt and 
mutant Cre expressing strains were generated by introducing plasmids pBS606, 
614, 626, 627, 628, and 650 into DH5a. 

Transformation of E. coli 

For plasmid transformations of E. coli strains, electroporation was 
preferred over chemical protocols. Electrocompetent cells were made and used 
for electroporation as described by Smith et al, Focm, 12:38-40 (1990), The 
appropriate cell porator and cuvettes were from Life Technologies (Bethesda, 
MD). 
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Site-Directed Mutagenesis 

The QuickChange™ Site-Directed Mutagenesis Kit (Stratagene Cloning 
Systems, La JoUa, CA) (1997) was used to generate defined single and double 
mutations in the ere gene. 
5 Overexpression and Purification of Candidate Mutants 

Wild-type and five different mutant Cre proteins were overexpressed 
using plasmids pRH200, and pBS654 to pBS658 in conjunction with Novagen 
B121 (DE3) cells. After induction for 2.5 h cells were harvested, sonicated, and 
Cre partially purified after DNase I digest with a single step Whatman(D PI 1 
1 0 resin (Whatman Inc., Fairfield, NJ) as described before by Wierzbicki et al., 1 
Mol Biol, 195:785-794 (1987). The obtained Cre preps were about 80% pure 
and protein concentrations ranged between 100 and 200 ng/)il. 

Mutant Analysis 

In Vivo: Plasmids pBS632 to pBS641 were transformed into Cre- 
1 5 expressing E. coli strains BS 1 576 (wt), BS 1 577 (E262G), BS 1 578 

(E262G/D29A), BS1579 (E262G/D189N), BS1580 (E262G/T3 1 6S), and 
BS1581 (R3M3). After 1 hour of induction of cre expression with 0.2% L-(+)- 
arabinose, 10'^ dilutions were plated on non-selecfion medium containing 0.2% 
D-glucose. After overnight incubation at 37^C, colonies were transformed to 
20 kanamycin-selection plates for the described negative selection for neo excision. 

In Vitro: Purified Cre mutants were used for both in vitro recombination 
and gel retardation experiments as described before by Sauer B., Nucleic Acids 
Res., 24:4608-4613 (1996) and Wierzbicki et al, J. Mol Biol, 195:785-794 
(1987), respectively. For the recombination reactions plasmids pBS632 to 
25 pBS641 served as substrates, whereas for the DNA binding reactions y['^^P]- 
dATP (Amersham Pharmacia Biotech, Piscataway, NJ) end-labeled 35 bp 
oligonucleotides were used, each encoding a /ox-halfsite and one half of the 
FASl spacer. 
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Evaluation of Six K2^fP^ ere Mutants 

Six of the 36 identified single Kan^ colonies of the third round were 
chosen for further analysis and referred to as R3M1 to 6 (Round 3 Mutants 1 to 
6). Retesting them in the indicator strain revealed that all but one (R3M4) show 
significant loxK2 recombination and all are unbiased in their activity on loxP 
(Table 14). Sequencing analysis revealed one amino acid change common to all 
five mutants having increased loxK2 activity: a glutamate to glycine exchange at 
position 262 (E262G) in the J helix of the Cre protein (Figure 13). A second 
point mutation, a conservative threonine to serine exchange at position 316 
(T3 16S), was identified in three of the mutants with enhanced loxK2 activity. 
Two non-conservative mutations (D29A and D189N) were found in two of the 
five mutants. In addition, ten mutations occurred only once. Therefore, the 
critical mutation for loxK2 activity appears to be E262G. 

Site-Directed Mutagenesis 

To address the question of the influence of the different point mutations 
further, the QuickChange™ Site-Directed Mutagenesis Kit (Stratagene) was 
applied to generate the following Cre mutants, each confirmed by sequencing: 
E262G, E262G/D29A, E262G/D189N, and E262G/T316S. 

In Vivo Characterization 

To elucidate the contribution of specific amino acid changes in 
conferring altered recombinational specificity to Cre, recombination assays with 
different lox sites were carried out with the following Cre enzymes: the wt 
enzyme, one of the originally sequenced third round mutants (R3M3), and the 
generated single and double mutants. All lox sites used for the in vivo tests were 
designed to have the same 8 bp spacer region (FASl) so that recombinational 
specificity was completely dependent on Cre's recognition of the symmetrical 
inverted repeats of the lox sites. Note that wt Cre-dependent recombination 
between loxP sites bearing the FASl spacer does not differ from recombination 
between original loxP sites (Sauer B., Nucleic Acids Res,, 24:4608-4613 
(1996)). 
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Figure 14 presents the recombination frequencies of various mutant lox^ 
substrates and combination of sites QoxP with loxK2 and as control with loxKl) 
from the marker excision assay. Mutant lox sites with symmetric nucleotide 
substitutions at positions 1 1 and 12 of the loxP sequence were tested with the wt 
5 enzyme and the five variant Cre mutants, including the multiple mutant 

R3M3(A). All enzymes showed a maximum in recombination (close to 100%) 
with thymines at these positions, i.e. with loxP^, Adenines, i.e. loxK2^, lowered 
the recombination frequencies drastically for the wt enzyme, whereas the single 
and double mutants performed approximately 50% to 70%) less effectively. 

10 R3M3, however, showed nearly loxP-like activity on the loxK2^ substrate, as 
seen before with the selection strain (Table 14). Altering the two thymines to 
guanines resulted in similar recombination frequencies as seen with adenines at 
these positions. Cytosines, on the other hand, did notresuh in similar 
recombination frequencies as seen with thymines, but were surprisingly the least 

15 efficiently recognized substitutions of positions 1 1 and 12. To conclude, it 
appears that the E262G mutation is necessary and sufficient to significantly 
increase recombination frequencies on lox sequences which are symmetrically 
altered at positions 1 1 and 12. Of the additional mutations tested, D29A seems 
to be sHghtly beneficial, whereas D189N and T3 16S appear indifferent or even 

20 slightly deleterious for recombination on the variant sites. Thus, additional 
mutations of R3M3 (Figure 13) must be responsible for its further increased 
performance on the mutant substrates in vivo. 

Figure 14B shows the observed recombination frequencies on mixed 
substrates (e.g. loxP with loxK2), For both loxK2 and the control substrate, 

25 loxKU recombination with loxP by wt enzyme was substantially less than for 

loxP^ recombination. This recombination frequency was increased dramatically 
with all of the mutant Cre protein. These results hint that not only a 
cooperativity in binding of two Cre molecules to one lox site exists but 
moreover also cooperativity between Cre molecules binding to different sites 

30 which then are synapsed and recombined. This finding is especially useful for 
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genomic targeting: It suggests that a targeting vector carrying a loxP site will be 
effectively recombined with the endogenous /ox-like site by the Cre mutants as 
long as the spacers are compatible. 

Noteworthy also is R3M3-Cre's increased ability to recombine loxKl by 
5 itself, whereas all the mutants, like wt, did not. Again, some additional 
mutations found in R3M3 (Figure 13) seem to be responsible not only for 
increased recombination frequencies on loxK2 but alao on loxKl compared to 
Cre bearing the E262G mutation alone or in conjunction with the D29A 
mutation. 

10 /« F/Yra Characterization 

In Vitro Recombination Assays, For the in vitro recombination 
experiments the same substrates as in vzvo-that is, pBS632 to pBS641-were 
used. An in vitro recombination experiment using wt, E262G, and R3M3 Cre 
preps on loxP^' loxK2^, and loxKf substrate showed that, as seen in vivo before, 

15 wt Cre is capable to recombine loxP^ substrates only. No recombination 
products are visible when loxKl^ or loxKl^ substrates were used. By 
introducing the E262G mutation into Cre, however, recombination of loxK2^ 
substrates becomes possible at elevated frequencies, and even for loxKl^ 
substrate recombination products become weakly visible in vitro, E262G-Cre 

20 activity on the loxKl control site in vitro but not in viyo probably derives from 
differences in ionic strength and/or enzyme concentration between the assays. 
Finally, with R3M3 Cre the ability for both loxK2^ and loxKl^ recombination is 
further increased, as expected from the in vivo assays. As mentioned in the in 
vivo results before, guanines at positions 1 1 and 12 of the lox sequence were 

25 recognized at similar frequencies as seen with adenines (i.e. loxK2), whereas 
cytosines were clearly less tolerated by all Cre mutants. Slight differences 
between in vivo and in vitro recombination frequencies are probably due to 
differences in ionic strength, temperature, DNA condensation, enzyme 
concentration, etc. In general, the pattern of the in vitro recombination 
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frequencies of the different Cre enzymes on the different lox sites mirrors the 
one seen in vivo, 

Gelshift Experiments, Gelshift experiments were applied to address the 
question of in vitro DNA-affmities of the different Cre mutants. As expected 
5 from the previous results, all three Cre enzymes bind with similar efficiency to 
loxP, whereas to loxK2, only E262G and R3M3 show binding affinity. 
Surprisingly, R3M3 binding appears less efficient on the loxK2 half-site than 
binding of E262G, and on the loxKl half-site only E262G does bind weakly 
whereas R3M3 does not. 
10 DISCUSSION 
lox sites 

loxP and loxK2 differ at several locations as illustrated in Figure 1 : First, 
the three outermost bp of the inverted repeats are altered (positions 15, 16, 17, 
15', 16' and 17'). Second, the entire non-canonical 8^ bp spacer is completely 

1 5 exchanged (positions 4 to 4'), and third, two transversion events (thymine to 
adenine) are introduced at positions 1 1 and 12 (and mirrored at 1 T and 12') of 
the lox site, mimicking potential recombination targets in eukaryotes. Only the 
two mutations at positions 1 1 and 12 are considered important for inhibiting wt 
Cre from recognizing the site. Thus, they were the only alterations in the loxi 

20 sites used for the in vivo and in vitro characterization experiments. Other 

investigations have shown that the two alterations of loxP mentioned first are 
without inhibiting effect on wt Cre (Sauer, B., Mol Cell Biol, 7:2087-2096 
(1987) and Sauer B,, Nucleic Acids Res., 24:4608-4613 (1996)). Noteworthy is 
however that the altered 8 bp spacer region (FAS 1 spacer) does not allow loxP - 

25 IoxK2 recombination, since the regions where the single-strand cleavages and 
exchanges take place are not compatible. To allow simultaneous monitoring of 
Cre-mediated recombination both at the wt loxP^ and at a mutant lox^ substrate 
(loxK2^\ incompatible spacer elements were used to prevent recombination 
between the two types of lox sites by a candidate Cre mutant with altered 

30 specificity. Such recombination might easily compromise ready detection of 
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desired Cre specificity mutants. Incompatible spacers (original loxP and FASl) 
formed the basis for the simultaneous selection for loxK2 recombination and 
screen for loxP recombination with E. coli strain BS1494 which led to the 
disclosed variant Cre recombinases. 
5 LoxKl, the other lox sequence used in this study, bears two critical bp 

exchanges per arm as well, however at different positions: 10 and 14. It was 
used as a control lox site, addressing the question whether the generated Cre 
mutants with novel specificity for loxK2 can also tolerate adjacent alterations 
within the lox sequence. 

10 Cre Mutants after Three Rounds of Directed Evolution 

Three iterations of the in vitro evolution procedure were used to identify 
36 candidates which express Cre mutants that could process loxK2 (based on the 
selection), as well as loxP (based on the simultaneous screen), wt Cre, on the 
other hand, only shows marginal activity on loxK2 when expressed at very high 

15 levels. 

Characterization 

Sequence analysis revealed that five mutants with significant loxK2 
activity had in common only one point mutation, leading to the amino acid 
change E262G. However, several other point mutations occurred independently 

20 twice or thrice, among which D29A, D189N, and T316S. To investigate the 

influence of the mentioned mutations on the observed phenotype, specific single 
and double mutants were generated by site-directed mutagenesis. In vivo and in 
vitro assays were then carried out with vrt-Cre and five different mutants 
(R3M3, E262G, E262G/D29A, E262G/D189N, E262G/T316S) to compare their 

25 performance on a variety of different alterations of the loxK2 site. 

The in vivo and in vitro recombination assays showed a similar pattern in 
recombination frequencies for the different enzymes on the different sites tested. 
In general, recombination frequencies on mutant substrates were distinctively 
the highest with R3M3. The double mutant E262G/D29A was about half as 

30 effective as R3M3, whereas the other double mutants and the single mutant 
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E262G showed slightly further decreased recombination frequencies on the 
altered sites. The wt enzyme did not recombine any of the mutant substrates 
presented here, neither in vivo nor in vitro. With previous results showing that 
single D29A, Dl 89N, and T3 16S mutants of Cre perform like the wt enzyme on 
5 loxK2 and loxKl in vivo and the fact that the E262G point-mutation was the only 
one found independently in all the originally isolated Cre mutants with loxK2 
specificity, it is clear that E262G is a critical mutation that allows Cre to 
recognize loxK2. However, in combination with E262G, D29A permits still 
higher recombination frequencies on lox sites altered at positions 1 1 and 12. 

10 Since R3M3 shows even further increased recombination frequencies on the 
altered sites compared to E262G/D29A, other of the point mutations identified 
in this Cre mutant (see Figure 14) must account for this increase in activity. 
Because of its location close to the amino-terminus of the J helix, the Q255R 
mutation of R3M3 could be expected to influence DNA contacts. Other 

1 5 mutations may influence protein folding or protein-protein interactions which 
could result in a higher flexibility within the Cre-/ox interface and thus allowing 
a better tolerance of alterations of the lox sequence. This hypothesis is also 
supported by the observation that R3M3 recognizes the loxKl site at frequencies 
similar to E262G recognizing loxK2. The double and single Cre mutants, on the 

20 other hand, did not show activity on loxKL In addition, the gel-shift 

experiments showed that R3M3-Cre's binding affinity for loxK2 and loxKl half- 
sites is less than E262G-Cre's and the three double mutant's. Taking these 
results together, other mutations of R3M3 must further influence Cre-/ox 
interactions to allow enhanced recombination on loxK2, On the one hand, this 

25 results in less efficient binding to lox half-sites, on the other hand, when 

complete lox sites are available, the cooperativity phenomenon between Cre 
enzymes binding to the same and different lox sits may compensate for this loss 
in binding activity. Then the postulated increased flexibility between DNA and 
protein seems to become advantageous for recognizing and recombining altered 

30 lox sites, as seen with the in vivo and in vitro recombination assays. 
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Alternatively, some mutations, also silent ones, could influence protein 
expression, leading to a faster accumulation of enzymes and consequently to 
higher recombination frequency. An E. coli codon usage table suggested, 
however, that none of the identified mutations should improve Cre expression 
5 in E. coli remarkably. 

Modeling 

With the published crystal structure of four Cre molecules bound to two 
synapsed loxA sites after the first single strand cleavage (Guo et ah, Nature, 
389:40-46 (1997)) the identified point mutations were analyzed for being 

1 0 involved in DNA and/or protein interactions. All of them, including the E262 
position in the J-helix of Cre were found to be not involved in either interactions 
in this state of the reaction. This observation indicates that the mutations which 
were found to account for the described novel substrate recognition in vivo and 
in vitro lead to this new phenotype in a less direct and obvious manner. As 

1 5 mentioned earlier, they may influence protein folding, resulting in a higher 
flexibility within the Cre-/ox interface. This hypothesis is especially well 
supported with the described differences between binding to lox halfsites and 
recombination. Alternatively, they may still be involved in protein-protein 
(D29A) or protein-DNA (E262G) contacts before or after the formation of the 

20 clamp-like strand-exchange state. 

In Vivo and In Vitro Recombination 

Some additional variations of the loxK2 site were tested also. The list in 
Table 17 shows all the sequences of lox sites which were tested and assigns 
them a name, as well as their plasmid (pBS) number. In Figure 15 the obtained 
25 in vivo recombination frequencies on all the variants of loxK2 are indicated. The 
additional resuhs indicate that alterations at position 12 of the lox halfsite are of 
more importance for Cre-based recombination than ones at position 11. 

The in vitro recombination frequencies of all six Cre enzymes tested on 
the lox sites listed above are given in table 15. The frequencies were calculated 
30 after quantitation of the brightness of fluorescence of the Ethidium-Bromide- 
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stained DNA fragments on agarose-gels. Differences in temperature, ionic 
strength, medium composition, and enzyme concentration probably account for 
the observed differences between in vivo and in vitro recombination results. 
Most strikingly, loxP is no longer recognized with the highest frequencies in 
5 vitro. However, when the ionic strength in the in vitro assay was increased 
results began to resemble the ones seen in vivo. Thus, efficiency of 
recombination with variant lox sites by each of the Cre mutant and the wt 
enzyme can be further controlled in vitro by adjusting ionic strength and other in 
vitro conditions. 

10 Qualitatively, however, the in vivo and in vitro recombination 

frequencies mirror each other. These novel Cre mutants thus possess a 
specificity for substrates (IoxK2 and its derivatives) which are not recognized by 
the wt enzyme, 
ill Gelshift Assays 

t 15 DNA binding (gelshift) experiments were also done with the generated 

□ Cre double-mutants (E262G with D29A, Dl 89N, or T3 1 6S) to analyzing 

;|i binding of the variant Cre recombinases to various recombination sites. In table 

:L 16 the observed mean percentages of binding to loxP, loxK2, and loxKl halfsites 

-2 with the five different Cre mutants and the wt enzyme are given. As shown, all 

ffi 20 mutants - in contrast to the wt enzyme - do bind to loxK2 with similar 

'z: frequencies, except R3M3 which shows surprisingly low retardation. As 

discussed above, this phenomenon may be explained with an increased 
tolerance, i.e., flexibility, of R3M3 for altered lox sites. On halfsites which 
precludes the cooperativity between Cre molecules in binding R3M3 cannot 
25 bind as tight as wt or the single and double mutants. With the loxKl halfsite this 
'binding versus recombination' difference is even more strikingly. Whereas wt 
and the R3M3 mutant cannot bind to the halfsite, all the other mutants can. Yet, 
recombination of loxKl substrates was seen with R3M3 Cre, only. These results 
show clearly that simple DNA affinity does not correlate in a one-to-one fashion 
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with recombination. Thus, inappropriate DNA binding by recombinases likely 
can lead to a block in recombination. 
Conclusions 

The in vivo data show that: 
5 1 . The E262G mutation confers a generally elevated level of recombination 
at a number of variant lox sites, those having any of a large number of 
alterations at positions 11 and 12 (and mirroring 11' and 12' alterations). 

2. The D 1 89N mutation in conjunction with the E262G mutation appears to 
fine tune the broadened specificity of the E262G mutation by reducing 

10 recombination at the loxKl variants 'GG', 'TC' and 'CC without decreasing 
recombination at loxK2 and the 'GT' variant. This mtitation is thus useful to 
limit the broadened specificity of E262G. 

3. The T316S mutation when in conjunction with E262G provides a slight 
boost in recombination frequency with the loxK2 variants 'GT' and 'CC', and 

1 5 has no deleterious effect on recombination at the other variant loxK2 sites. 

4. The D29A mutation together with E262G boosts recombination at loxK2 
and the variants 'CC' and 'GG'. D29A does not reduce recombination at the 
other variant lox sites or at loxP, 

5. Some of the additional mutations of R3M3 must account for the fiarther 
20 increased recombination fi^equency on any of the tested mutant lox sites, 

including loxKl but do not compromise loxP recognition. 

The disclosed variant recombinases have a number of useful features and 
applications. By recognizing an altered, user-defined target site, they were 
designed to allow both genetic targeting events in prokaryotes and eukaryotes 
25 like v^ Ore but on different sites and in vitro recombination strategies. With a 
wider variety of possible target sequences being now accessible, multiple and 
defined genomic alterations will now become feasible. This opens more 
possibilities in designing genomic manipulations in all DNA-based organisms 
by site-specific recombination. 
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With the various genome projects under way today, there will be more 
and more applications for site-specific recombination to study the impact of 
genes or genetic control elements by genomic engineering. In addition, genome 
manipulations are also more frequently used to express recombinant proteins 
within the organism of choice. 

Table 13* Presented are the results of the first four rounds of the 
described mutagenesis procedure for loxK2 specificity. Column one indicates 
the round of the mutagenesis procedure (round 0 indicates that no mutagenesis 
event has taken place yet, wt enzyme, only) and column two the allotted time for 
ere expression in induction medium. Column three presents the observed 
frequencies of Kan^, indicating loxK2 recombination by a mutant candidate. Of 
note, all Kan^ colonies found during this experiment were white, indicating 
effective loxP recombination. In column four the actual number of Kan^ 
colonies found in each round of 1000 to lO"^ plated is given, and the last column 
presents the frequency of white colonies on non-selection plates which is 
decreasing with every round. 

Table 14. Presented are the frequencies of Kan^ of six chosen candidate 
mutants from round three of the mutagenesis procedure (R3M1 to 6) when 
retesting them in the indicator strain. Also, they were tested for their 
performance on loxP sites in identical fashion. The obtained Kan^ frequencies 
indicate the percentages of recombination within the allotted induction time of 
2.5 hours. 

It is understood that the disclosed invention is not limited to the 
particular methodology, protocols, and reagents described as these may vary. It 
is also to be understood that the terminology used herein is for the purpose of 
describing particular embodiments only, and is not intended to limit the scope of 
the present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the 
singular forms "a "an", and "the" include plural reference unless the context 
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clearly dictates otherwise. Thus, for example, reference to "a host cell" includes 
a plurality of such host cells, reference to "the antibody" is a reference to one or 
more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

5 Unless defined otherwise, all technical and scientific terms used herein 

have the same meanings as commonly understood by one of skill in the art to 
which the disclosed invention belongs. Although any methods and materials 
similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the preferred methods, devices, and materials 

10 are as described. Publications cited herein and the material for which they are 
cited are specifically incorporated by reference. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such 
disclosure by virtue of prior invention. 

Those skilled in the art will recognize, or be able to ascertain using no 

1 5 more than routine experimentation, many equivalents to the specific 

embodiments of the invention described herein. Such equivalents are intended 
to be encompassed by the following claims. 
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Table 2 



Table 2: List of Strains Used for Selection and Screen 



U O 7T 


L/ei>cupiion 


Substrates 


Selection and Screen 


1493 


BS583 [pBS583] 


/oxKlV/avP- 


neo selection and lacZ screen, respectively 


1494 


BS583 [pBS5841 


loxKl'/lox?' 


neo selection and lacZ screen, respectively 


1541 


BS583 [pBS6131 


loxP- 1 loxP- 


neo selection and lacZ screen, respectively 


1523 


NS2300 [pBS601] 


loxKV-/loxP- 


abf A.St screen and neo selection, respectively 


1524 


NS2300 [pBS602] 


loxKlVloxP- 


abfA.st screen andheo selection, respectively 



nciy.n-5 J T^o auu c^vi^utug jjiuccuuics lor me aesirea ere mutants. 

BS1493 and BS 1494 served as selection strains for KT or K2' Cre mutants, respectively. BS 1541 was used as a 
control strain to determine loxP activity. BSi523 and BS1524 were used to select for P' mutants. Construction 
and application of the different selection procedures are explained in detail in the text. 
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Tables 3 and 4 



Table 3: Sequencing Primers for ere 



Primer BSB# 


Sequence 5' - 3' 


Position (5% 3') in ere 


454 


TTT GGG CTA GCG AAT TCG AG 


-55, -36 


455 


TTT GGG CCA GCT AAA CAT GC 


273, 292 


456 


CGG TGG GAG AAT GTT AAT CC 


567, 586 


457 


GGA CAC AGT GCC CGT GTC ^ 


862, 879 


458 


TCT GCG TTC TGA TTT AAT CTG 


1117, 1097 


459 


CCA GGC CAG GTA TCT CTG 


858, 841 


460 


GTA CGT GAG ATA TCT TTA ACC C 


563, 542 


461 


TTG CTG GAT AGT TTT TAC TGC C 


270, 249 



Presented are the eight sequencing printers used for sequencing of the entire ere gene in both directions and their 
positions within the ere coding sequence. Primers BSB 454 to BSB457 allow sequencing of the coding strand, 
whereas the four remaining primers have been designed for the non-coding strand. In order to achieve a complete 
sequence information in both directions, about 350 bases have to be read out of each sequencing reaction. The 
positions given for each primer refer to its 5* and 3' end, with position 1 referring to the adenine of the ATG start 
codon of ere. 



Table 4: Primers Used in the QuickChange'^^ Site-Directed Mutagenesis Kit (Stratagene) 



Primer 

::-..';:v.i!;^f.V::..' 


: : ^ : : ^ ^- ■ : . ■ ' : - ^ i 






UCiAl'CAACiXJGCUCCCl'GUGAUUGA'll-l'ri'GAAGCAACl'CAlUG 


765, 809 


466 


GAGrrGCrTCAAAAATCCCTCCCAGGGCGCGAGTTGATAGCTGGC 


805, 761 




GCrATCAACTCGCGCCCTGGCAGGGATTTTTGAAGCAACTCATCG 


765, 809 




GAGTTGCTTCAAAAATCCCTGCCAGGGCGCGAGTTGATAGCTGGC 


, 805,761 


469 


GCTATCAACTCGCGCCNNNNNNNNNATmTGAAGCAACTCATCG 


765, 809 


470 


GAGTTGCTTCAAAAATNNNNNNNNNGGCGCGAGTTGATAGCTGGC 


805, 761 



Illustrated are the mutant oligonucleotides used in the QuickChange*^^ Site-Directed Mutagenesis Kit (Su^atagene) 
to introduce single point mutations (mismatches with the wt sequence are highlighted by bold letters). With 
BSB465/466 a E262G mutation, and with BSB467/468 a E262A mutation is introduced in Cre, The last set of 
primers (BSB469/470), as indicated by the bold N, represent an equimolar mixture of all possible bases at the 
assigned positions. This mixture results in all possible aa combinations at positions 261 to 263 of Cre. As 
mentioned before, the indicated positions refer to 5' and 3' end of the oligonucleotides in the ere coding sequence. 
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Table 5 



Table 5: Comparison of Mutagenic and Non-Mutagenic PGR on wt ere 









i'iiiiiiiiip 






> 99.6% 


< 0.4% 


> 99.5% 


< 0.5% 




19.7% 


81.3% 


60.8% 


39.2% 



Hie experiments presented here served two purposes: (i) to compare the influence of glucose (SOC medium) and 
arabinose on ere expression with pBAD33, {it) to compare the frequencies of deleterious point mutations 
occurring during non-mutagenic and error-prone PGR. The first column indicates the medium used for repression 
(SOC), or induction of cr^ expression (SOB+ara). The second column compares the observed frequencies of blue 
and white colonies after a non-mutagenic PCR, the third column after a mutagenic PGR. Blue colonies indicate, 
that the /ojcP^ flanked lacZ gene of BS583 is not excised by Gre, white ones its Cre-mediated excision. Note that 
the number of blue colonies resulting from empty pBAD33 vector (approximately i% in all experiments) was 
subtracted before calculating the presented values. This blue background was determined with a control ligation 
lacking ere insertion. With glucose in the medium, only blue colonies are found, indicating that ere is not 
expressed. With arabinose induction, however, the frequencies of disabled Gre mutants (blue) can be determined, 
increasing three times under mutagenic conditions. 



99735Svl 



100 



OMRF178 
204S7/247 



Table 6 



Table 6: The First Four Rounds on loxKl' 



Round 


Status 


Induction 


% White 


% Kan^ 


Nb. Kan*^ 


Nb. Pooled for 








(Non-Seiection) 


(Wliite) 


(White) 


the Next Round 


1 


ox 


4 h 


83.9% 


0.01% 


I 


none 




mx 


4 h 


42.4% 


0.16% 


6 


6 


2 


mxox 


2.5 h 


88.3% 


0,02% 


i 


none 






4 h 


82.5% 


0-56% 


47 


47 




mxmx 


2-5 h 


13% 


<0.01% 


0 


none 






4 h 


24% 


0,3% 


5 


none 


3 


mxoxox 


2.5 h 


80,7% 


0.2% 


36 


30 






4 h 


85% 


25% 


10' 


none 




mxoxmx 


2.5 h 


25.7% 


0,02% 


1 


none 






4 h 


21.8% 


5% 


250 


none 


4 


mxoxoxox 


2,5 h 


75% 


4,6% 


102 


none 




mxoxoxmx 


2.5 h 


< 7.7% 


0,3% 


3 


none 



Presented are the results of the first four rounds of the described mutagenesis procedure for loxK2 specificity. The 
following symbols for the status of the ere pools have been used: "o" indicates a non-mutagenic, "m" a 
mutagenic PCR, and "x'' stands for the shuffling step by DNase I digest and reassembly. Column one indicates 
the round of the mutagenesis procedure, column two its status as mentioned above, and column three the time 
permitted for ere expression in induction medium. In column four, the calculated percentages of white colonies on 
non-selection plates is indicated, decreasing with every round, especially when the pools were subjected to 
mutagenic PCR*s twice. Column five presents the observed frequencies of Kan*^, indicating loxKl recombination 
by a mutant candidate. Of note, that all Kan^ colonies found during this experiment were white, indicating 
effective lox? recombination. The two last columns give the actual numbers of Kan*^ colonies found in each 
round (1/5 dilution) and mark the ones used as starting material for the next round. 
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Table 7 



Table 7: Functional Evaluation of Six Selected mxoxox K2* Mutants and wt Cre 



Cre 

fi n Hirl 51 tp 


inQUCtion 
Time 

JL 1. 1.1.1. V 




^ TTiin*^ with 
/C 'Xvdii Wit 11 


RS1493 c/rtrtfi^V 


Wt 


0 ^ h 


U.VJUZ /o 


yD.y. fO 


u.UUl /O 


4 h 


n.d. 


n,d. - 


0 002% 


mxoxox 1 


2.5 h 


66.4% 


98.2% 


0.004% 


4h 


n.d. 


n.d. 


0.005% 


mxoxox 2 


2.5 h 


53.6% 


98% 


0.007% 


4h 


n.d. 


n.d. 


0.005% 


mxoxox 3 


2.5 h 


66.5% 


80.5% 


0.007% 


4h 


n.d. 


n.d. 


0.01% 




2.5 h 


0.05% 


80.3% 


0.008% 


4h 


n.d. 


n.d. 


0.006% 




2.5 h 


15.2% 


81.2% 


0.006% 


4h 


n.d. 


n.d. 


0.009% 


Vj^mxoxox 6 • 


2.5 h 


3.1% 


83.7% 


0.003% 


4h 


n.d. 


n.d. 


0.01% 



The results of functional testing of the six chosen ¥X!?^ Cre can<Udates compared to wt Cre are listed. All 
enzymes were checked on /oxK2, loxV and loxYA recombination in the selection strains BSI494» BS1493 and 
the control strain BS 1541. In the first column the Cre candidate is indicated, in the second the allowed time for 
cre expression. The three following columns show the calculated percentages of Kan^ when subjecting the 
different Cre mutants and wt Cre to the three selection strains. The frequency of the observed Kan^ phenotype is 
considered as an indicator for the frequency of lox recombination: On /ojcK2, all mutants, except mxoxox 4, 
show remarkably increased Kan^ frequencies compared to wt Cre. On /oxP, all (including mxoxox 4), show very 
similar results as wt Cre, indicating, that lox^ recombination is at best slightly affected. On loxYA neither 
mutant nor wt Cre show more than background activity, even after four hours of cre expression. The obtained 
mutants are therefore characterized by a broader substrate recognition compared to the wt enzyme. 
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Table 8 



Table 8: Sequence Analysis of the Six mxoxox K27P* Mutants 





Cre 


Codon 


Change 


in cre 


aa Change 


Nb. of 


Position in 




Candidate 


Posilion 


wt — 


— ► mut 




Isolates 


Cre 




mxoxox 1 


45 


CCG 


cci 


(Pi5P) 


2 


(N-ierminal) 






86 


GAC 


GCC 


D29A 


2 


A 






565 


GAC 


AAC 


D189N 


2 


loop 1 - 2 






642 


AGC 


AGI 


(S214S}" 


1 


(N-lerm. of I) 






679 


GTC 


ATC 


V227I 


1 


I 






785 


GAA 


GGA 


E262G* 


5 


J 




mxoxox 2 


45 


CCG 


CCI 


(P15P) 


2 


(N-terminal) 






4!3 


GAA 


GGA 


E138G 


i 


N-term. of F 






760 


AGC 


GGC 


S254G 


1 


loop 5 - J 






785 


GAA 


GGA 


E262G* 


5 


J 






861 


TCI 


TCC 


{S287S*) 


2 


N-term. of K 






946 


ACC 


ICC 


T316S 


3 


loop L - M 




mxoxox 3 


46 


GTC 


ATC 


V16I 


1 


N-term inal 






565 


GAC 


AAC 


D189N 


2 


loop I - 2 






592 


GGC 


AGC 


G198S 


1 


C-term, of 2 






668 


CGA 


CAA 


R223Q 


i 


I 


£ 




764 


GAG 


CGG 


Q255R 


1 


loop 5 - J 






785 


GAA 


GGA 


E262G* 


5 


J 






861 


TCI 


TCC 


(S287S*) 


2 


N-term. of K 






920 


CCG 


CIG 


P307L 


i 


L 




mxoxox 4 


230 


TAT 


TGT 


Y77C 


1 


C 






412 


GAA 


AAA 


E138K 


1 


N-term, of F 






851 


GIG 


GAG 


L284Q 


2 


loop J - K 




mxoxox 5 


86 


GAC 


GCC 


D29A 


2 


A 






302 


CGG 


GAG 


RiOiQ* 


1 


D 






659 


CIG 


GAG 


L220Q 


1 


I 






785 


GAA 


GGA 


E262G* 


5 


J 






946 


ACC 


ICC 


T316S 


3 


loop L - M 




mxoxox 6 


785 


GAA 


GGA 


E262G* 


5 


J 






851 


CIG 


CAG 


L284Q 


2 


loop J - K 






946 


ACC 


ICC 


T316S 


3 


loop L - M 



The point mutations identified by sequence analysis in the six selected K2VP"' mutants are listed. Column one 
identifies the mutants, column two the observed point mutations and their position in the cre coding sequence. 
Column three indicates the resulting aa changes and their positions in the Cre enzyme (silent mutations in 
parenthesis). DNA contacting residues, according to the crystal structure (Guo, eL ciL, 1997), are marked by an 
asterisk. Column four indicates how often the different point mutations were independently found in the pool of 
the six mutants, and column five shows where the aa changes are located in the secondary structure of the protein. 
Of the altogether 31 point mutations listed, live are silent. Only one aa change, E262G, is common to all 
mutants with remarkably increased /avK2 activity (n\xoxox 1, 2, 3, 5, and 6), suggesting that this mutation is 
the essential one for the observed phenotype. Nine point mutations occuired twice or three limes in the pool, 
whereas eleven were found in one mutant only. For furliicr information, see text and appendices A, B, and C. 
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Table 9 



Table 9: Functional Evaluation of Three Defined Cre Mutants for Position 262 and wt Cre 



Cre Mutant 


Induction 
Time 


BS1494 (/oa;K2^) 


% Kan" with 
BS1541 (/ojcP^) 


BS1493 (/oxKl^) 


wt 


2.5 h 


0.002% 


93.9% 


0.001% 


E262G 


2.5 h 


2.2% 


84.2% 


0.004% 


E262A 


2.5 h 


0.4% 


82.3% 


0.006% 


E262W 


2.5 h 


2.9% 


' 81.7% 


0.003% 



enzymes were checked on /ojcK2, lox^ and loxYA recombination in the selection strains BS1494, BS1493 and 
the control strain BS 1 541. In the first column the Cre candidate is indicated, in the second the allowed time for 
cre expression. The three following columns show the calculated percentages of Kan^ when subjecting the 
different Cre mutants and wt Cre to the three selection strains. The frequency of the observed Kan^ phenotype is 
considered as an indicator for the frequency of lo^ recombination: On /o;:K2, E262G and E262W show 
approximately an lO"^ fold increased Kan*^ frequency compared to wt Cre, whereas E262A results in a 200 fold 
increase. On /ojcP, ail show very similar results as wt Cre, indicating, that iox^ recombination is at best slightly 
affected. On loxYA neither mutant nor wt Cre show more than background activity. This indicates, that E262G 
and E262W alone are sufficient to remarkably increase Ore's activity on /oxK2, that these mutations do not 
increase ioxYA recognition, and that lox? activity may be slightly affected. 
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In Vitro Evolution of the Cre recombinase 



wtCre 

mxoxox { 

mxoxox2 

mxoxox3 

mxoxox4 

mxoxox5 

mxoxox6 

-55 



TTTGGGCTAG 
TTTGGGCTAG 
TTTGGGCTAG 
TTTGGGCTAG 
TTTGGGCTAG 
TTTGGGCTAG 
TTTGGGCTAG 



CGAATTCGAG 
CGAATTCGAG 
CGAATTCGAG 
CGAATTCGAG 
CGAATTCGAG 
CGAATTCGAG 
CGAATTCGAG 



CTCGGTACCC 
CTCGGTACCC 
CTCGGTACCC 
CTCGGTACCC 
CTCGGTACCC 
CTCGGTACCC 
CTCGGTACCC 



GGGGATCCTC 
GGGGATCCTC 
GGGGATCCTC 
GGGGATCCTC 
GGGGATCCTC 
GGGGATCCTC 
GGGGATCCTC 



TAGACTCAGT 
TAGACTGAGT 
TAGACTGAGT 
TAGACTGAGT 
TAGACTGAGT 
TAGACTCAGT 
TAGACTGAGT 



wtCre 

mxoxoxl 

mxoxox2 

mxoxox3 

mxoxox.4- 

mxoxoxS 

tnxoxox6 

-5 ; 



GTGAA^^xyrc 

GTGAAaTjTC 
GTGAA>"GTC 
GTGAA:^_73TC 
GTGAAr.TOTC 
GTGAAaTOTC 
GTGAAATC-TC 



caatttactg 
caatttactg 
caatttactg 
caatttactg 

CAATTTACTG 
CAATITACTG 
CAATTTACTG 



ACCGTACACC 
ACCGTACACC 
ACCGTACACC 
ACCGTACACC 
ACCGTACACC 
ACCGTACACC 
ACCGTACACC 



AAAATTTGCC 
AAAATTTGCC 
AAAATTTGCC 
AAAATTTGO: 
AAAATTTGCC 
AAAATTTGCC 
AAAATTTGCC 



TGCATTACCG 
TGCATTACCT 
TGCATTACCT 
TGCATTACCG 
TGCATTACCG 
TGCATTACCG 
TGCATTACCG 



WtCre ::. GTCGATGCAA 
;;;mxax^^^ GTCGATGCAA 
GTCGATGCAA 
lirixoxSxKi ATCGATGCAA 
?;^iraoxbx^ GTCGATGCAA 
GTCGATGCAA 
GTCXJATGCAA 



CGAGTGATGA 
CGAGTGATGA 
CGAGTGATGA 
OjAGTGATGA 
CGAGTGATGA 
CGAGTGATGA 
CGAGTGATGA 



GGTrCXX:AAG 
GGTTCGCAAG 
GGTTGGCAAG 
GGTTCGCAAG 
GGTTGGCAAG 
GGTTCGCAAG 
GGTTGGCAAG 



AACCTGATGG 
AAGCTGATGG 
AACCTGATGG 
AACXTGATGG 
AACCTGATGG 
AACCTGATGG 
AACCTGATGG 



ACATGTTCAG 
CCATGTTCAG 
ACATGTTCAG 
ACATGTTCAG 
ACATGTTCAG 
CCATGTTCAG 
ACATGTTCAG 




lirixoxoxS^^^ 



GGATCGCCAG 
GGATCGCCAG 
GGATCGCCAG 
GGATCGCCAG 
GGATCGCCAG 
GGATCGCCAG 
GGATCGCCAG 



GCGTHTCTG 
GOGTTTTCTG 
GCGTmCTG 
GCGTnrCTG 
GCXnTTTCTG 
GCGTTTTCTG 
GCGTmCTG 



AGCATAOCTG 
AGCATAOCTG 
AGCATACCTG 
AGCATACCTG 
AGCATACCTG 
AGCATACCTG 
AGCATACCTG 



GAAAATGCTT 
GAAAATGCTT 
GAAAATGCTT 
GAAAATGCTT 
GAAAATGCTT 
GAAAATGCTT 
GAAAATGCTT 



CrGTCCGTTT 
CTGTCCGTTT 
CTGTCCGTTT 
CTCjrCCGTTr 
CTGTCCGTTT 
CTGTCCGTTT 
CTGTCCGTTT 





GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


■mxoxoxly 


GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


';rnxoxox2 • 


GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


mxpxoxS : 


GOZGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


•mxoxoxl 


GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


:;mxoxoxi5c:. 


GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


lnixoXQX<5;:^ 


GCCGGTCGTG 


GGCGGCATGG 


TGCAAGTTGA 


ATAACCGGAA 


ATGGTTTCCC 


WtCre • • : 


GCAGAACCTG 


AAGATGTTCG 


CGATTATCTT 


CTATATCrrC 


AGGCGCGCGG 


...mxoxoxl 


GCAGAACCTG 


AAGATGTTCG 


CGATTATCFT 


CTATATCTTC 


AGGCGCGCGG 


:mxoxox2 


GCAGAACCTG 


AAGATGTTCG 


CGATFATCTT 


CTATATCTTC 


AGGCGCGCGG 


mxoxox3 


GCAGAACCTG 


AAGATGTTCG 


CGATTATCTT 


CTATATCTTC 


AGGCGCGCGG 


mxoxox4 


GCAGAACCTG 


AAGATGTTCG 


CGATTATCTT 


CTATGTCTTC 


AGGCGCGCGG 


mxoxox5 


GCAGAACCTG 


AAGATGTTCG 


CGATTATCTT 


CTATATCTTC 


AGGCGCGCGG 


mxoxox6 


GCAGAACXTG 


AAGATGTTCG 


CXiATTATCTT 


CTATATCTTC 


AGGCGCGCGG 


196 
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fn Vitro Evolution nf the Cre recombmase 



wtCre 

mxoxoxl 

mxoxox2 

mxoxox3 

mxoxox4 

mxoxoxS 

mxoxox6 

246 



TCTGGCAGTA 
TCTGGCAGTA 
TCTX3GCAGTA 
TCTXXICAGTA 
TCTGGCAGTA 
TCTGGCAGTA 
TCTGGCAGTA 



AAAACTATCC 
AAAACTATCC 
AAAACTATCC 
AAAACTATCC 
AAAACTATCC 
AAAACTATCC 
AAAACTATCC 



AGCAACATTT 
AGCAACATTT 
AGCAACATTT 
AGCAACATTT 
AGCAACATTT 
AGCAACATTT 
AGCAACATTT 



GGGCCAGCTA 
GGGCCAGCTA 
GGGCCAGCTA 
GGGCCAGCTA 
GGGCCAGCTA 
GGGCCAGCTA 
GGGCCAGCTA 



AACATGCTTC 
AACATGCTTC 
AACATGCTTC 
AACATGCTTC 
AACATGCTTC 
AACATGCTTC 
AACATGCTTC 



wtCre ATCGTCGCTC 

mxoxoxl ATCGTCGGTC 

mxoxo^2 ATCGTCGGTC 

mxoxoxS ATCGTCGCTC 

,mxoxox4 ATCGTCGGTC 
;>iiixoxpx5 ^ ATCGTCAGTC 

^ mxaxGX6 ATCGTCGGTC 

WW 

Mt<M:&M GTTATGCGGC 
lll^llo^li GTTATGCGGC 
m00Ms GTTATGCGGC 
liM^lil GTTATGCGGC 
^l^ii GTTVVD3CGGC 
l^W GTDVrGCGGC 
. GTTATGCGGC 

ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ACAGGCTCTA 
ig. ACAGGCTCTA 
ACAGGCTCTA 
'WMmm ACAGGCTCTA 




CGGGCTGCCA 
CGGGCTGCCA 
CGGGCTGCCA 
CGGGCTGCCA 
CGGGCTGCCA 
CGGGCTGCCA 
CGGGCTGCCA 



GGATCCGAAA 
GGATCCGAAA 
GGATCCGAAA 
GGATCCGAAA 
GGATCCGAAA 
GGATCXXjAAA 
GGATCCGAAA 



GCGTTCGAAC 
GCGTTCGAAC 
GCGTTCGGAC 
GCGTTCGAAC 
GCGTTCAAAC 
GCGTTCGAAC 
GCGTTCGAAC 



CGACCAAGTG 
CGACCAAGTG 
CGACCAAGTG 
ODACCAAGTG 
CGACCAAGTG 
CGACCAAGTG 
CGACCAAGTG 



AGAAAACGTT 
AGAAAACGTT 
AGAAAACGTT 
AGAAAACGTT 
AGAAAACGTT 
AGAAAACGTT 
AGAAAACGTT 



GCACTGATTT 
GCACTGATTT 
GCACTGATTT 
GCACTGATTT 
GCACTGATTT 
GCACTGATTT 
GCACTGATTT 



ACAGCAATGC 
ACAGCAATGC 
ACAGCAATGC 
ACAGCAATGC 
ACAGCAATGC 
ACAGCAATGC 
ACAGCAATGC 



GATGCCGGTG 
GATGCOGGTG 
GATGCCGGTG 
GATGCCGGTG 
GATGCCGGTG 
GATGCCGGTG 
GATGCCGGTG 



OaACCAGGTT 
CGAOCAGGTT 
CGACCAGGTT 
CGACCAGGTT 
CGACCAGGTT 
CXjACCAGGTT 
CGACCAGGTT 



TGTTTCACTG 
TGTTTCACTG 
TGTTTCACTG 
TGTTTCACTG 
TGTTTCACTG 
TGTTTCACTG 
TGTTTCACTG 



AACGTGCAAA 
AACGTGCAAA 
AACGTGCAAA 
AACGTGCAAA 
AACGTGCAAA 
AACXJTGCAAA 
AACGTGCAAA 



CGTTCACTCA 
CGTTCACTCA 
CGTTCACTCA 
CGTTCACTCA 
CGTTCACTCA 
CGTTCACrCA 
CGTTCACTCA 



:-li5xSx8x2 :" ■ 
mxidxoxS 

,..^j[it^<^px4:.-::i; 
:'::;tBMx^bx5;;^^ 
:mxoxdx6?:^ 



TGGAAAATAG 
TGGAAAATAG 
TGGAAAATAG 
TGGAAAATAG 
TGGAAAATAG 
TGGAAAATAG 
TGGAAAATAG 



CGATCGCTGC 
CGATCGCTGC 
CGATCGCTGC 
CGATCGCTGC 
CGATCGCTGC 
CGATCGCTGC 
CGATCGCTGC 



CAGGATATAC 
CAGGATATAC 
CAGGATATAC 
CAGGATATAC 
CAGGATATAC 
CAGGATATAC 
CAGGATATAC 



CTAATCTGGC 
CTAATCTGGC 
CTAATCTGGC 
CTAATCTGGC 
CTAATCTGGC 
CTAATCTGGC 
CTAATCTGGC 



ATTTCTGGGG 
ATTTCTGGGG 
ATTTCrGGGG 
ATTTCTGGGG 
ATTTCTGGGG 
ATTTCTGGGG 
ATTTCTGGGG 



"w}£^yW-- ATTGCTTATA 

^mxc)x<)xl ATTGCTTATA 

SxSBx2; - ATTGCTTATA 

mxoxQxS ATTGCTTATA 

mxoxoxl ATTGCTTATA 

mxoxdxS ATTGCTTATA 

mxoxox6 ATTGCTTATA 

496 



ACACCCTGTT 
ACACCCTGTT 
ACACCCTGTT 
ACACCCTGTT 
ACACCCTGTT 
ACACCCTGTT 
ACACCCTGTT 



ACXTTATAGCC 
ACCTATAGCC 
ACCTATAGCC 
ACCTATAGCC 
ACCTATAGCC 
ACCTATAGCC 
ACCTATAGCC 



GAAATTGCCA 
GAAATTGCCA 
GAAATTGCCA 
GAAATTGCCA 
GAAATTGCCA 
GAAATTGCCA 
GAAATTGCCA 



GGATCAGGCT 
GGATCAGGCT 
GGATCAGGCT 
GGATCAGGCT 
GGATCAGGCT 
GGATCAGGCT 
GGATCAGGCT 
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In Vitro Evoiutton of the Cre recombmase 



wtCre 




TTArOTACTG 


ACGGTGGGAG 


mxoxoxl 


TAAAGATATC 


TCACGTACTA 


ACGGTGGGAG 


mxoxox2 


TAAAGATATC 


TCACGTACTG 


ACGGTGGGAG 


mxoxox3 


TAAAGATATC 


TCACGTACTA 


ACGGTGGGAG 


mxoxox4 


TAAAGATATC 


TCACGTACTG 


ACGGTGGGAG 


mxoxoxS 


TAAAGATATC 


TCACGTACTG 


ACGGTGGGAG 


mxoxox6 


TAAAGATATC 


TCACGTACTG 


ACGGTGGGAG 


546 









AATGTTAATC 
AATOFTAATC 
AATGTTAATC 
AATGTTAATC 
AATGTTAATC 
AATGTTAATC 
AATGTTAATC 



CATATTGGCA 
CATATTGGCA 
CATATTGGCA 
CATATTAGCA 
CATATTGGCA 
CATATTGGCA 
CATATTGGCA 



wtCre 

mxoxoxi 

mxGX0x2 

mx6xox3 

mxpxox4 

in X 0X0x5 

mxoxpxG: 



GAACGAAAAC 
GAACGAAAAC 
GAACGAAAAC 
GAACGAAAAC 
GAACGAAAAC 
GAACGAAAAC 
GAACGAAAAC 



GCTGGTTAGC 
GCTGGTTAGC 
GCTGGTTAGC 
GCTGGTTAGC 
GCTGGTTAGC 
GCTGGTTAGC 
GCTGGTTAGC 



ACCGCAGGTG 
ACCGCAGGTG 
ACCGCAGGTG 
ACCGCAGGTG 
ACCGCAGGTG 
ACCGCAGGTG 
ACCGCAGGTG 



TAGAGAAGGC 
TAGAGAAGGC 
TAGAGAAGGC 
TAGAGAAGGC 
TAGAGAAGGC 
TAGAGAAGGC 
TAGAGAAGGC 



ACTTAGCCTG 
ACTTAGTCTG 
ACTTAGCCTG 
ACTTAGCCTG 
ACTTAGCCTG 
ACTTAGCCTG 
ACTTAGCCTG 



^|ft]iX&xoX:l:;,l-. 



GGGGTAACTA 
GGGGTAACTA 
GGGGTAACTA 
GGGGTAACTA 
GGGGTAACTA 
GGGGTAACTA 
GGGGTAACTA 



AACTGGTCGA 
AACTGGTCGA 
AACTGGTCGA 
AACTGGTCGA 
AACTGGTCGA 
AACAGGTCGA 
AACTGGTCGA 



GCGATGGATT 
GCGATGGATT 
GOjATGGATT 
GCAATGGATT 
GCGATGGATT 
GCGATGGATT 
GCGATGGATT 



TCCGTCTCTG 
TCCATCTCTG 
TCCGTCTCTG 
TCCGTCTCTG 
TCCGTCTCTG 
TCCGTCTCTG 
TCXXJTCTCTG 



GTGTAGCTGA 
GTGTAGCTGA 
GTGTAGCTGA 
GTGTAGCTGA 
GTGTAGCTGA 
GTGTAGCTGA 
GTGTAGCTGA 




TGATCXXjAAT 
TGATCCGAAT 
TGATCCGAAT 
TGATCCGAAT 
TGATCCGAAT 
TGATCCGAAT 
TGATXXXjAAT 



AACrA<XTGT 
AACTACCTGT 
AACTACCTGT 
AACTACCTGT 
AACTAOJTGT 
AACTACCTGT 
AACTACCTGT 



TTTGCOGGGT 
TTTGOCGGGT 
TTTGCCGGGT 
TTTGCCGGGT 
TTTGCCGGGT 
TTTGCCGGGT 
TTTGCCGGGT 



CAGAAAAAAT 
CAGAAAAAAT 
CAGAAAAAAT 
CAGAAAAAAT 
CAGAAAAAAT 
CAGAAAAAAT 
CAGAAAAAAT 



GGTGTTGOCG 
GGTGTTGOCG 
GGTGTTGGCG 
GGTGrTTGCCG 
GGTGTTGCOG 
GGTGTTGCGG 
GGTGTTGCCG 



|^^QX.SpX^^^ 
?:lrtXQ XOx3 - ; 

:inxoxQx4 
; mxoxox5 : ; 
irixaxox^ 




CGCCATCTGC 
CGCCATCTGC 
CGCCATCTGC 
CGCCATCTGC 
CGCCATCTGC 
CGCCATCTGC 
CGCCATCTGC 



CACCAGCCAG 
CACCAGCCAG 
CACCGGCCAG 
CACCAGCCGG 
CACCAGCCAG 
CACCAGCCAG 
CACCAGCCAG 



CTATCAACTC 
CTATCAACTC 
CTATCAACTC 
CTATCAACTC 
CTATCAACTC 
CTATCAACTC 
CTATCAACTC 



GCGCCCTGGA 
GCGCCCTGGG 
GCGCCCTGGG 
GCGCCCTGGG 
GCGCCCTGGA 
GCGCCCTGGG 
GCGCCCTGGG 




AGGGATTTT 
AGGGATTTT 
AGGGATTTT 
AGGGATTTT 
AGGGATTTT 
AGGGATTTT 
AGGGATTTT 



WtCre; ;:; GAAGCAACTC ATCGATTGAT 

mxpxoxl: GAAGCAACTC ATCGATTGAT 

imxoxoxl • GAAGCAACTC ATCGATTGAT 

i mxoxoxi GAAGCAACTC ATCGATTGAT 

mxoxpx4>; GAAGCAACTC ATCGATTGAT 

mxoxox5 GAAGCAACTC ATCGATTGAT 

mxoxox6 GAAGCAACTC ATCGATTGAT 

796 



TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 

TTACGGCGCT AAGGATGACT CTGGTCAGAG 
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Title (csA-^ 



Jn Vitro Evolution of the Cre recombinase 



wtCrc 




TGGTCTGGAC 


ACAGTGCCCG 


tnxoxox I 


ATACCTGGCC 


TGGTCTGGAC 


ACAGTGCCCG 


mxoxox2 


ATACCTGGCC 


TGGTCCGGAC 


ACAGTGCCCG 


mxoxox3 


ATACCTGGCC 


TGGTCCGGAC 


ACAGTGCCCG 


mxoxox4 


ATACCAGGCC 


TGGTCTGGAC 


ACAGTGCCCG 


mxoxox5 


ATACCTGGCC 


TGGTCTGGAC 


ACAGTGCCCG 


mxoxox6 


ATACCAGGCC 


TGGTCTGGAC 


ACAGTGCCCG 


846 









TGTCGGAGCC 
TGTCGGAGCC 
TGTCGGAGCC 
TGTCGGAGCC 
TGTCGGAGCC 
TGTCGGAGCC 
TGTCGGAGCC 



GCGCGAGATA 
GCGCGAGATA 
GCGCQAGATA 
GCGCGAGATA 
GCGCGAGATA 
GCGCGAGATA 
GCGCGAGATA 



wtCre 

mxoxoxl 

mxoxox2 

mxoxox3 

mxoxox4 

mxQxox5 

mxoxQx6 

896 



TGGCCCGCGC 
TGGCCCGCGC 
TGGCCCGCGC 
TGGCCCGCGC 
TGGCCCGCGC 
TGGCCCGCGC 
TGGCCCGCGC 



TGGAGTTTCA 
TGGAGTTTCA 
TGGAGTTTCA 
TGGAGTTTCA 
TGGAGTTTCA 
TGGAGTTTCA 
TGGAGTTTCA 



ATACCGGAGA 
ATACCGGAGA 
ATACCGGAGA 
ATACTGGAGA 
ATACXXXjAGA 
ATACCGGAGA 
ATACCGGAGA 



TCATGCAAGC 
TCATGCAAGC 
TCATGCAAGC 
TCATGCAAGC 
TCATGCAAGC 
TCATGCAAGC 
TCATGCAAGC 



TGGTGGCTGG 
TGGTGGCTGG 
TGGTGGCTGG 
TGGTGGCTGG 
TGGTGGCTGG 
TGGTGGCTGG 
TGGTGGCTGG 



xiwtCre 



fSnixBliMft 



AGCAATGTAA 
ACCAATGTAA 
TCCAATGTAA 
ACCAATGTAA 
ACCAATGTAA 
TCCAATGTAA 
TCCAATGTAA 




mm^m aggggcamg 

^l^g^ aggggcaatg 
^^^J aggggcaatg 

tjnf Bi^lp aggggcaatg 
^^^m^^Mk AGGGGCAATG 



atattgtcat 

ATATTGTCAT 
ATATTGTCAT 
ATATTGTCAT 
ATATTGTCAT 
ATATTGTCAT 
ATATTGTCAT 



GTGCGCCTGC 
GTGCGOTTGC 
GTGCGCCTGC 
GTGCGCCTGC 
GTGCGCCTGC 
GTGCGCCTGC 
GTGCGCCTGC 



GAACTATATC 
GAACTATATC 
GAACTATATC 
GAACTATATC 
GAACTATATC 
GAACTATATC 
GAACTATATC 



TGGAAGATGG 
TGGAAGATGG 
TGGAAGATGG 
TGGAAGATGG 
TGGAAGATGG 
TGGAAGATGG 
TGGAAGATGG 



CGTAACCTGG 
CGTAACCTGG 
CGTAACCTGG 
CGTAACCTGG 
CXjTAACCTGG 
CGTAACCTGG 
CGTAACCTGG 



CGATTAGCCA 
CGATTACjCCA 

cgattaocca 
cgattagcca 
cgattaocca 
cgattagcca 
cgattaocca 

1032 



ATAGTGAAAC 
ATAGTGAAAC 
ATAGTGAAAC 
ATAGTGAAAC 
ATAGTGAAAC 
ATAGTGAAAC 
ATAGTGAAAC 



TTAACGCGTA 
TTAACGCGTA 
TTAACGCGTA 
TTAACGCGTA 
TTAACGCGTA 
TTAACGCGTA 
TTAACGCGTA 



-;vA<MMl^€ AATGATAAGC 

M5>cH|^ AATGATAAGC 

^::mHx;6x|^;?;- AATGATAAGC 

nixoxoxa AATGATAAGC 

mxox6x4 ; : AATGATAAGC 

:;;mx(ix'oX57;;> AATGATAAGC 

• mxoxQx6:;i AATGATAAGC 



TTGGCTGTTT 
TTGGCTGTTT 
TTGGCTGTTT 
TTGGCTGTTT 
TTGGCTGmT 
TTGGCTGTTT 
TTGGCTGTTT 



TGGCGGATGA 
TGGCGGATGA 
TGGCGGATGA 
TGGCGGATGA 
TGGCGGATGA 
TGGCGGATGA 
TGGCGGATGA 



GAGAAGATTT 
GAGAAGATTT 
GAGAAGATTT 
GAGAAGATTT 
GAGAAGATTT 
GAGAAGATTT 
GAGAAGATTT 



TCAGCCTGAT 
TCAGCCTGAT 
TCAGCCTGAT 
TCAGCCTGAT 
TCAGCCTGAT 
TCAGCCTGAT 
TCAGCCTGAT 



wtCrq S 


ACAGATTAAA 


TCAGAACGCA 


GA 




rnxoxoxl 


ACAGATTAAA 


TCAGAACGCA 


GA 


A T O - Start Codoa 


mxoxox2 


ACAGATTAAA 


TCAGAACGCA 


GA 




tnx 0X0x3 


ACAGATTAAA 


TCAGAACGCA 


GA 


'^-^^3 _ Stop Codon 


mxoxox4 


ACAGATTAAA 


TCAGAACGCA 


GA 




nixoxox5 


ACAGATTAAA 


TCAGAACGCA 


GA 


T - Point Mutation 


mxoxox6 


ACAGATTAAA 


TCAGAACGCA 


GA 




1096 
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In Vitro Evolution of the Cre recombina.se 



wtCre 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMDM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxoxl 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMAM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxox2 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMDM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxox3 


MSNLLTVHQN 


LPALPIDATS 


DEVRKNLMDM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxox4 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMDM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxoxS 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMAM 


FRDRQAFSEH 


IWKMLLSVCR 


mxoxox6 

1 


MSNLLTVHQN 


LPALPVDATS 


DEVRKNLMDM 


FRDRQAFSEH 


IWKMLLSVCR 








A 




B 


wtCre 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGLAVKIIQQ 


HLGQLNMLHR 


mxpxbxl 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGLAVKIIQQ 


HLGQLNMLHR 


:mxoxox2 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGLAVKTiQQ 


HLGQUSTMLHR 


mxdx6x3 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGLAVKIIQQ 


HLGQLNMLHR 


mxoxox4 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLCLQA 


RGL^VKIIQQ 


HLGQLNMLHR 


mfoxQx5 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGL^VKTIQQ 


HLGQLNMIHR 


.mxoxox6: 


SWAAWCKLNN 


RKWFPAEPED 


VRDYLLYLQA 


RGLAVKIIQQ 


HLGQU^MLHR 



c 



D 



||il]ix6M: 



RSGLPRPSDS 
RSGLPRPSDS 
RSGLPRPSDS 
RSGLPRPSDS 
RSGLPRPSDS 
QSGLPRPSDS 
RSGLPRPSDS 



NAVSLVMRRI 
NAVSLVMRRI 
NAVSLVMRRI 
NAVSLVMRRI 
NAVSLVMRRI 
NAVSLVMRRI 
NAVSLVMRRI 



RKENVDAGBl 
RKENVDAGER 
RKENVDAGER 
RKBWDAGER 
RKENVDAGER 
RKENVDAGER 
RKENVDAGER 



AKQALAFERT 
AKQALAFERT 
AKQALAFGRT 
AKQALAFHIT 
AKQALAFKRT 
AKQALAFERT 
AKQALAFTOT 



DFDQVRSLME 
DFDQVRSLME 
DFDQVRSLME 
DFDQVRSLME 
DFDQVRSLME 
DFDQVRSLME 
DFDQVRSLME 



E 





NSDRCQDIRN 


LAFLGIAYNT 


LLRlAeARI 


RVKDISRTDG 


GRMLIHIGRT 


:;mxC3ixpXl||: 


NSDHCQDIRN 


LAFLGIAYNT 


LLRIAEIARI 


RVKDISRTNG 


GRMLIHIGRT 


mxoxpx2 


NSDRCQDIRN 


LAFLGIAYNT 


LLEIAEIARI 


RVKDISRTDG 


GRMLIHIGRT 


mxoxpxS 


NSDRCQDIRN 


LAFLGIAYNT 


LLRIAEIARI 


RVKDISRTNG 


GRMLIHISRT 


mxoxbx4 


NSDRCQDIRN 


LAFLGIAYNT 


LLHIAEIARI 


RVKDISRTDG 


GRMLIHIGRT 


mx 0X0x5 


NSDRCQDIRN 


LAFLGIAYNT 


LLRIAEIARI 


RVKDISRTDG 


GRMLIHIGRT 


mxoxQx.6 


NSDRCQDIRN 


LAFLGIAYNT 


LLRIAEIARI 


RVKDISRTDG 


GRMLIHIGRT 


151 














G 


H 


1 


2 
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In Vitro Evolution of the Cre recombinase 



wtCre 

mxoxoxl 

mxoxox2 

mxoxox3 

mxoxox4 

mxoxoxS 

mxoxox6 

201 



KTLVSTAGVE 
iCTLVSTAGVE 
KTLVSTAGVE 
iCDLVSTAGVE 
iCTLVSTAGVE 
KTLVSTAGVE 
KTLVSTAGVE 



KALSLGVTKL 
KALSLGVTKL 
KALSLGVTKL 
KALSLGVTKL 
KALSLGVTKL 
KALSLGVTKQ 
KALSLGVTKL 



VERWISVSGV 

VERWISISGV 

VERWISVSGV 

VEQWISVSGV 

VERWISVSGV 

VERWISVSGV 

VERWISVSGV 



ADDPNNYLFC 
ADDPNNYLFC 
ADDPNNYLFC 
ADDPNNYLFC 
ADDPNNYLFC 
ADDPNNYLFC 
ADDPNNYLFC 



RVRKN GVAAP 
RVRKN GVAAP 
RVRKN GVAAP 
RVRKN GVAAP 
RVRKNG VAAP 
RVRKN GVAAP 
RVRKN GVAAP 



4 





wtCre 


SATSQLSTBA 


LEGIFEATHR 


UYGAKDDSG 


QRYLAWSiHS 


A>:VGAARDMA 






SATSOI STRA 


I GGIFEATHR 


UYGAKDDSG 


QRYL^WSGHS 


AfiVGAARDMA 




mxoxox2 


SATGQL^TRA 


LGGIFEATHR 


UYGAKDDSG 


QRYLAWSQHS 


ARVGAARDMA 




mxoxoxB 


SATSRLSTEA 


LGGIFEATHR 


UYGAKDDSG 


QRYLAWSSHS 


ARVGAARDMA 




mxq.xox4 


SATSQLSTRA 


LEGIFEATHR 


UYGAKDDSG 


QBYQAW^HS 


ARVGAARDMA 




mxoxoxS 


SATSQLSTBA 


LGGIFEATHR 


UYGAKDDSG 


QRYLAW^HS 


ARVGAARDMA 




mx 0X0x6 


SATSQLSIBA 


LGGIFEATHR 


UYGAKDDSG 


QRYQAWSQHS 


ABVGAARDMA 




251 
































J 






K 




,::wtGre.::^-; ;;;; 


RAGVSIPEIM 


QAGGVrmVNI 


VMNYIRNLDS 


ETGAMVRLLE 


DGD* 




;;::mx<i^oxT: 


; RAGVSIPEIM 


QAGGWmVNI 


VMKiTENLDS 


ETGAMVRUJE 


DGI> 




^:imxQxox2 


' RAGVSIPEIM 


QAGGWSNVNI 


VMNYIRNLDS 


ETGAMVRLLE 


DGI> 




mxoxQxS: 


: RAGVSILEM 


QAGG¥/TNVNI 


VMN\1RNLDS 


ETGAMVRLLE 


DGD* 






: RAGVSIPEIM 


QAGG\¥TNVNI 


VMNvlfiNLDS 


ETGAMVRLLE 


DGD- 




mxoxpx5: 


RAGVSIPEIM 


QAGGWSNVNI 


VMNYIRNLDS 


ETGAMVRLLE 


DGD- 


;™ 


ymx6xox6:^ 


. RAGVSIPEIM 


QAGGWSNVNI 


VMNARNLDS 


ETGAMVRLLE 


DGI> 




301 






























L 




M 


N 





G - mutated residue 
^ - catalytic residue 
F - DNA contacting residue 

- a-helix 

- (3 sheet 
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Transition 


Frequency 


Transversion 


Frequency 


A-G 


29 % (9) 


A.-C 


6.5 % (2) 


G-A 


26 % (8) 


C-A 


< 3 % (0) 


C-T 


6.5 % (2) 


A-T 


9.7 % (3) 


T-C 


6.5 % (2) 


T-A - 


9.7 % (3) 




■ ,:.:C-G;: 


< 3 % (0) 


;VV:G.- e?^;;^^;- 


< 3 % (0) 




6.5 % (2) 


■t: :;,\T:-G:;:v.::?;^ 


< 3 % (0) 



31 point mutations could be identified in the six analyzed mutants. The mutagenic 
frequency can therefore be calculated as 0.5%, No frame-shift mutations, or more than one 
point mutation per codon of the ere sequence were found. The table classifies these 
mutations into the different types of transition and transversion events. Given are the 
frequencies and in parenthesis the actual numbers which were found. Interestingly, the A to 
G and vice versa transisiton occurred much more often than all other possible events. On 
the other hand, half of the possible trans versions were not identified at all. 
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Table 13 



Round 


Induction 


% Kan^ 


Nb. Kan** 


% White 


0 


4h 


< 0.01% 




100% 


1 


4h 


0.16% 


6 


84% 


2 


4h 


0.56% 


47 


82% 


3 


2.5 h 


0.2% 


36 


80% 


4 


2.5 h 


4.6% 


102 


75% 



i3 

m 

Q 
■M 

m 

0 

^¥ 
Q 

m 

Q 
u 
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Table 14 



% Kan' 



Mutant 


loxK2^ 


loxP^ 


wt 


<0.01% 


94% 


R3M1 


66% 


98% 


R3M2 


54% 


98% 


R3M3 


65% 


91% 


R3M4 


0.02% 


93% 


R3M5 


15% 


91% 


R3M6 


3% 


94% 



a 
iji 

Q 

S 

m 

Q 
Q 
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CLAIMS 

We claim: 

1 . A method of identifying variant recombinases that mediate recombination 
at variant recombination sites, the method comprising, 

(a) bringing into contact 

a mutant recombinase, 

a first nucleic acid sequence comprising a first reporter gene and first 
and second recombination sites, wherein the first and second recombination 
sites are variant recombination sites, and 

a second nucleic acid sequence comprising a second reporter gene and 
third and fourth recombination sites, wherein the third and fourth 
recombination sites can be recombined by a non-mutant recombinase, 

(b) determining if recombination occurs between the first and second 
recombination sites, and determining if recombination occurs between the third and 
fourth recombination sites, 

wherein recombination between the first and second recombination sites 
indicates that the mutant recombinase is a variant recombinase that mediates 
recombination at variant recombination sites, 

wherein recombination between the third and fourth recombination sites 
indicates that the mutant recombinase retains the ability to mediate recombination at 
non-variant recombination sites. 

2. The method of claim 1 wherein the recombination sites comprise 
recognition sequences and compatibility sequences, 

wherein the recognition sequences of the first and second recombination sites 
differ fi:om the recognition sequences of the third and fourth recombination sites, 

wherein the compatibility sequences of the first and second recombination 
sites are sufficiently similar to allow recombination between the first and second 
recombination sites, and wherein the compatibility sequences of the third and fourth 
recombination sites are sufficiently similar to allow recombination between the third 
and fourth recombination sites, and 
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wherein the compatibiUty sequences of the first and second recombination 
sites differ from the compatibiUty sequences of the third and fourth recombination 
sites such that neither the first nor the second recombination site can be recombined 
with either the third or the fourth recombination site. 

3. The method of claim 1 wherein the first and second recombination sites 
cannot be recombined by non-mutant recombinase to a significant extent. 

4. The method of claim 1 or 2 wherein the first and second recombination 
sites have identical sequences, and wherein the third and fourth recombination sites 
have identical sequences. 

5. The method of claim 1 wherein recombination between the first and 
second recombination sites alters the expression of the first reporter gene, wherein 
recombination between the first and second recombination sites is determined by 
determining if expression of the fu-st reporter gene is altered, and 

wherein recombination between the third and fourth recombination sites 
alters the expression of the second reporter gene, wherein recombination between the 
third and fourth recombination sites is determined by determining if expression of 
the second reporter gene is altered. 

6. The method of claim 5 wherein recombination between the first and 
second recombination sites allows the first reporter gene to be expressed. 

7. The method of claim 6 wherein the first nucleic acid sequence fiirther 
comprises a spacer sequence flanked by the first and second recombination sites, 
wherein the spacer sequence interrupts the first reporter gene such that the first 
reporter gene is not expressed, wherein recombination of the first and second 
recombination sites excises the spacer sequence which allows the first reporter gene 
to be expressed. 

8. The method of claim 6 wherein a portion of the first reporter gene is 
inverted, wherein the inverted portion of the first reporter gene is flanked by the first 
and second recombination sites, wherein recombination of the first and second 
recombination sites inverts the inverted portion of the first reporter gene which 
allows the first reporter gene to be expressed. 
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9. The method of claim 5 wherein recombination between the first and 
second recombination sites prevents expression of the first reporter gene. 

10. The method of claim 9 wherein the furst reporter gene is flanked by the 
first and second recombination sites, wherein recombination of the first and second 
recombination sites excises the first reporter gene which prevents expression of the 
first reporter gene. 

1 1 . The method of claim 9 wherein a portion of the first reporter gene is 
flanked by the first and second recombination sites, wherein recombination of the 
first and second recombination sites inverts the flanked portion of the first reporter 
gene which prevents expression of the first reporter gene. 

12. The method of claim 5 wherein recombination between the third and 
fourth recombination sites allows the second reporter gene to be expressed. 

13. The method of claim 12 wherein the second nucleic acid sequence 
fiirther comprises a spacer sequence flanked by the third and fourth recombination 
sites, wherein the spacer sequence interrupts the second reporter gene such that the 
second reporter gene is not expressed, wherein recombination of the third and fourth 
recombination sites excises the spacer sequence which allows the second reporter 
gene to be expressed. 

14. The method of claim 13 wherein the spacer sequence interrupts the 
second reporter gene such that the second reporter gene is not transcribed. 

15. The method of claim 13 wherein the second reporter gene encodes a 
protein, wherein the spacer sequence interrupts the second reporter gene such that the 
protein encoded by the second reporter gene is not translated. 

16. The method of claim 13 wherein the spacer sequence interrupts the 
second reporter gene such that the second reporter gene produces an inactive 
expression product. 

17. The method of claim 12 wherein a portion of the second reporter gene is 
inverted, wherein the inverted portion of the second reporter gene is flanked by the 
third and fourth recombination sites, wherein recombination of the third and fourth 
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recombination sites inverts the inverted portion of the second reporter gene which 
allows the second reporter gene to be expressed. 

1 8. The method of claim 5 wherein recombination between the third and 
fourth recombination sites prevents expression of the second reporter gene to be 
expressed, 

19. The method of claim 18 wherein the second reporter gene is flanked by 
the third and fourth recombination sites, wherein recombination of the third and 
fourth recombination sites excises the second reporter gene which prevents 
expression of the second reporter gene. 

20. The method of claim 18 wherein a portion of the second reporter gene is 
flanked by the third and fourth recombination sites, wherein recombination of the 
third and fourth recombination sites inverts the flanked portion of the second reporter 
gene which prevents expression of the second reporter gene. 

21. The method of claim 1 wherein the first nucleic acid sequence is a first 
nucleic acid construct and the second nucleic acid sequence is on a second nucleic 
acid construct. 

22. The method of claim 21 wherein the first nucleic acid construct is an 
extrachromosomal vector and the second nucleic acid construct is in the genome of a 
host cell. 

23. The method of claim 1 wherein the first and second nucleic acid 
constructs are on the same nucleic acid construct. 

24. A method for producing site-specific recombination of DNA, 
comprising, 

contacting a variant recombinase identified by the method of claim 1 with 
first and second DNA sequences, 

wherein the first DNA sequence comprises a first recombination site and the 
second DNA sequence comprises a second recombination site, 

wherein the variant recombinase mediates recombination between the first 
and second recombination sites thereby producing the site specific recombination. 
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25. The method of claim 24 wherein the first recombination site, the second 
recombination site, or both, are variant recombination sites. 

26. The method of claim 24, wherein the first and second DNA sequences 
are connected by a pre-selected DNA segment. 

27. The method of claim 26, wherein the first and second recombination sites 
have the same orientation and the site-specific recombination of DNA is a deletion of 
the pre-selected DNA segment. 

28. The method of claim 27, wherein the pre-selected DNA segment is a 
gene for a structural protein, an enzyme, or a regulatory molecule. 

29. The method of claim 27 further comprising contacting the variant 
recombinase with a fourth DNA sequence comprising a third recombination site, 
wherein the second and fourth DNA sequences are connected by a second pre- 
selected DNA segment. 

30. The method of claim 29 wherein the first recombination site is a variant 
recombination site recognized by the variant recombinase and not by wild type 
recombinase, and wherein the second and third recombination sites are 
recombination sites recognized by wild type recombinase and by the variant 
recombinase. 

3 1 . The method of claim 30 fiirther comprising, prior to contacting the 
variant recombinase with the first, second, and third recombination sites, contacting 
the recombination sites with wild type recombinase, thereby producing site specific 
recombination between the second and third recombination sites resulting in a 
deletion of the second pre-selected DNA segment, 

32. The method of claim 29, wherein the second pre-selected DNA segment 
is a gene for a structural protein, an enzyme, or a regulatory molecule. 

33. The method of claim 26, wherein the first and second recombination sites 
have opposite orientations and the site-specific recombination is an inversion of the 
nucleotide sequence of the pre-selected DNA segment, 

34. The method of claim 33, wherein the first and second recombination sites 
are variant recombination sites recognized by the variant recombinase. 
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35. The method of claim 33, wherein the pre-selected DNA segment is a 
gene for a structural protein, an enzyme, or a regulatory molecule. 

36. The method of claim 24, wherein the second and third DNA sequences 
are introduced into two different DNA molecules and the site-specific recombination 
is a reciprocal exchange of DNA segments proximate to the recombination sites. 

37. The method of claim 36, wherein the first and second recombination sites 
are variant recombination sites recognized by the variant recombinase. 

38. The method of claim 24 wherein the second DNA sequence includes a 
label, wherein recombination between the first and second recombination sites 
associates the label with the first DNA sequence. 

39. The method of claim 38 wherein the first DNA sequence is a large 
circular DNA molecule. 

40. The method of claim 24 wherein recombination occurs in a cell. 

41. The method of claim 40 wherein the variant recombinase is contacted 
with the first and second DNA sequences by introducing into the cell a third DNA 
sequence comprising DNA encoding the variant recombinase. 

42. The method of claim 41 , wherein the third DNA sequence fiirther 
comprises a regulatory nucleotide sequence and expression of the variant 
recombinase is produced by activating the regulatory nucleotide sequence. 

43. The method of claim 40, wherein the cell is a eukaryotic cell, a 
mammaUan cell, a yeast cell, a fiingal cell, a prokaryotic cell, a bacterial cell, an 
archae bacterial cell, or a cell in a multicellular organism. 

44. The method of claim 43 wherein the multicellular organism is a plant, an 
animal, or a mammal. 

45. The method of claim 40, wherein the first and second DNA sequences 
are connected by a pre-selected DNA segment, wherein the first and second 
recombination sites have the same orientation and the site-specific recombination of 
DNA is a deletion of the pre-selected DNA segment. 

46. The method of claim 45 wherein the cell is in a multicellular organism. 
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47. The method of claim 45, wherein the pre-selected segment is an 
undesired marker or trait gene. 

48. The method of claim 24 wherein the variant recombinase is contacted 
with the recombination sites in vitro. 

49. The method of claim 48 wherein the method further comprises 
introducing the recombined DNA into a cell. 

50. A method for cloning large DNA fragments, the method comprising 
concatenating DNA fragments to be cloned with vector arms, wherein each 

vector arm comprises a recombination site, wherein the DNA fragments and vector 
arms alternate in the concatemer, 

introducing the concatemer into a cell expressing a variant recombinase 
identified by the method of claim 1, wherein the recombinase mediates 
recombination of the recombination sites thereby generating circles each containing 
a DNA fragment and a vector arm. 

5 1 . A method for cloning large DNA fragments, the method comprising 
ligating a DNA fragment to be cloned to vector arms, wherein each vector 

arm comprises (i) a blunt end, (ii) another end which is compatible with an end of the 
DNA fragment to be cloned, and (iii) a recombination site, wherein concatemers are 
not formed, and 

introducing the ligated DNA fragment and vector arms into a cell expressing 
a variant recombinase identified by the method of claim 1. 

52. A method for cloning large DNA fragments, the method comprising 
ligating a plurality of DNA fragments to be cloned with a plurality of first 

and second vector arms, wherein each first vector arm comprises two ligatable ends, 
wherein each second vector arm comprises a recombination site and one non- 
ligatable end, 

wherein, following ligation, the DNA fragments and first vector arms 
alternate in concatemers, wherein the concatemers are flanked by second vector 
arms. 
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introducing the concatemers into a cell expressing a variant recombinase 
identified by the method of claim 1, wherein the recombinase mediates 
recombination of the recombination sites thereby generating circles containing the 
DNA fragments. 

53. A variant recombinase identified by the method of claim 1 . 

54. A nucleic acid molecule encoding a variant recombinase identified by the 
methodof claim 1. 

55. The nucleic acid molecule of claim 54 wherein the nucleic acid molecule 
is a plasmid. 

56. A cell containing the nucleic acid molecule of claim 54. 

57. The cell of claim 56 wherein the cell is a eukaryotic cell, a mammalian 
cell, a yeast cell, a fimgal cell, a prokaryotic cell, a bacterial cell, an archae bacterial 
cell, or a cell in a multicellular organism. 

58. The cell of claim 57 wherein the multicellular organism is a plant, an 
animal, or a mammal. 

59. A nucleic acid molecule having at least one variant recombination site, 
wherein the variant recombination site is recognized by a variant recombinase 
identified by the method of claim 1 and is not recognized by wild type recombinase. 

60. The nucleic acid molecule of claim 59 wherein the recombination site is 
not recognized by wild type recombinase. 

61. The nucleic acid molecule of claim 59, wherein a first recombination site 
and a second recombination site are connected by a pre-selected DNA segment. 

62. The nucleic acid molecule of claim 59 wherein the nucleic acid molecule 
is a plasmid. 

63. A cell containing the nucleic acid molecule of claim 59. 
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METHOD FOR SELECTING RECOMBINASE VARIANTS 
WITH ALTERED SPECIFICITY 

ABSTRACT OF THE DISCLOSURE 
Disclosed are variants of Cre recombinase that have broadened specificity for 
the site of recombination. Specifically, the disclosed variants mediate recombination 
between sequences other than the loxP sequence and other lox site sequences on 
which wild type Cre recombinase is active. In general, the disclosed Cre variants 
mediate efficient recombination between lox sites that wild type Cre can act on 
(referred to as wild type lox sites), between variant lox sites not efficiently utilized 
by wild type Cre (referred to as variant lox sites), and between a wild type lox site 
and a variant lox site. Also disclosed are methods of recombining nucleic acids 
using the disclosed Cre variants. For example, the disclosed Cre variants can be 
used in any method or technique where Cre recombinase (or other, similar 
recombinases such as FLP) can be used. In addition, the disclosed Cre variants 
allow different alternative recombinations to be performed since the Cre variants 
allow much more efficient recombination between wild type lox sites and variant lox 
sites. Control of such alternative recombination can be used to accomplish more 
sophisticated sequential recombinations to achieve results not possible with wild 
type Cre recombinase. 
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Figures 1 and 2 
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Fig, i : Comparison of the three different lox sites used in this study. loxP is the original substrate for wt Cre, 
catalyzing site-specific recombination. The characteristic two 13 bp inverted repeats and the two ATG start 
codons (located in the non-canonical spacer of 8 bp) are indicated by the horizontal arrows. The strand cleavage 
sites for Cre are indicated by the vertical arrows. This cleavage leads to a 6 bp 5' overhang and a 
3' phospohotyrosine linkage with Y324, Cre's catalytic residue. The positions in the lox sites are assigned as 
proposed by Hoess, et. at. (1982). 

loxKl and loxKl, the two altered lox sites used in this study, differ from lox? mainly in positions 
10/14, and 11/12, respectively (as indicated by bold letters). These alterations are believed to inhibit wt Cre 
activity on the sites. The other differences, the complete 8 bp spacer and three outermost bp (indicated by bold 
letters) were shown not to influence wt Cre. 




Eie^ Illustration of the influence of the direction of lox? sites on Cre recombination. If the sites are orientated 
m the same direction (defined by the two ATGs in the 8 bp non-canonical spacer region) recombination 
catalyzed by Cre will result in excision of the DNA flanked by the lox sites, whereas recombination events on 
sites orientated in the same direction will result in an inversion. 
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Figure 3: Illustration of tlie 
random mutagenesis procedure, 
called DNA shuffling. The 
procedure mimics - what is 
believed to be evolution: 
mutagenic eveius and 
recombination between pools of 
different but highly related 
sequences: 

In vitro, a PGR of the gene of 
interest {ere) is used as the basis 
for the mutagenic event, resulting 
in a pool of varying but related 
Sequences. Random fraginentation 
leads to the indicated pool of 
random size fragments, breaking 
the linkage between the occun-ed 
point mutations, A reassembly 
process by PGR eventually 
allows to recover an enlarged 
variety of different mutants. The 
final step, a selection for the 
desired mutants, permits to 
recover the ones with 
combinations of beneficial 
mutations and eliminates 
deleterious combinations. The 
selected pool will then serve as 
starting material for the next 
round. 



Figures 5 and 6 
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Fig. 5: Represented are agarose gels used 
during the mutagenesis procedure (DNA 
shuffling). Gels were run after DNase 1 digest 
and after the reassembly process by self-priming 
and reassembly PGR. 

(A) The random fragment pools obtained after 
the DNase 1 digest and separated on a 2% GTG 
Sea Kern Agarose gel (FMC, Rockland, ME) 
are shown. 

(B) The same gel is presented: The random 
fragments of 25 bp to 300 bp size have 
migrated and stuck to the described DE81 
paper resin (V/hatman, Maidstone, UK). 

(C) 5^il aliquots of the self-priming PGR mix 
and the reassembly PGR are analyzed by gel 
electrophoresis. During self-priming PGR (left 
lane) a smear of fragments of 300 bp to 2 kb 
has been generated out of the pool of small 
fragments. However, no distinct product is 
achieved. After the reassembly PGR (right 
lane), in which the original ere primers are 
added, one distinct product of approximately 
LI kb size is obtained. This represents a pool 
of different ere mutants, to be used in the 
selection procedure. 




Fig. 6: Shown is pBAD33, chosen for the expression 
of the mutant ere pools during the selection procedure 
(adapted form Guzman, e/. a/., 1995), pBAD33 
contains parts of the arabinose operon, including the 
pBAD promoter and the gene encoding for the 
regulatory protein AraG. Any gene inserted into the 
MGS will be under pBAD control. This permits to 
regulate its expression from low level to nearly 
complete repression by changing 0.2% L-(+)-arabinose 
in the medium to 0.2% D-glucose. In addition, Gm 
can be used for selection for pBAD33 containing cells 
and the pAGYG184 derived origin of replication is 
compatible to ColEi origins. 



Figure 7 




Fig. 7: Construction of the first generation of selection plasmids, pBS568 and pBS569: (0 pBS561 was 
assembled using three fragments: the RSVneo backbone, the 5'-modified neo gene from pBS398, and the MCS. 
The used restriction sites are highlighted. The complete sequence of the MCS, made of oligonucleotides, is 
indicated, too. Note the SD sequence, ATG start, and TGA stop codons, necessary for the described selection 
strategy, {if) In the following steps, the EGFP gene as a spacer was inserted along with the two lox sites 
orientated in the same direction, by using the highlighted restriction sites. Unexpectedly, lox recombination in 
the final selection plasmids did not lead to neo expression as selectable marker for desired Kl* or K2 Cre 
mutants. For further information, see text. 



Figures 8 and 9 




Fig. 8: Construction of the second generation of selection plasmids, pBS583 and pBS584: (/) The original neo 
sequence was restored by inserting the lox^ cassette of pBS568 and pBS569 into the RSVneo backbone by using 
the highlighted restriction sites, {ii) A transcriptional terminator, rniBTiTj, obtained by PGR from pBAD33, was 
inserted 3' of the EGFP spacer as indicated. The derived plasmids were suitable for the desired selection for KT or 
Cre mutants: Excision of the EGFP spacer and the transcriptional terminator by Kl* or K2^ Cre-mediated 
loxK\ or loxKl recombination rendered the E, colt host Kan*^ as anticipated. For more information, see text. 




Fig. 9: Control plasmid pBS6l3: A lox? selection 
plasmid, constructed similarly to pBS583 and pBS584. It 
was used as control plasmid to detennine the 
recombination frequency of Cre mutants on ioxP (as 
described for loxKl and loxKl before). 



Figure 10 





Fig. 10: Constnjction of the screening plasmids pBS601 and pBS602: (/) Deletion of the Pvu II fragment of neo 
of plasmids pBS581/pBS582 (an intermediate of the pBS583/584 construction) restored the possibility to use it 
as selectable marker in a different selection, provided by the bacterial background (see text), (//) The EGFP spacer 
gene was replaced by the lac promoter derived from pUC 19 by PGR, (///) the abf A.st marker gene from pBS48 i , 
encoding an a-L-arabinofuranosidase, was inserted between the two lox sites. Note, that the restriction sites used 
during this construction process are highlighted as before. loxKl or ioxKl recombination will result in excision 
ofabfAsi, allowing a blue/white screen with Z-ara as described in the text. 
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Figure '7 



wt- FRT: GAAGTTCCTATTC TCTAGAAA GTATAGGAACTTC 

Note - alternatively, a full site can also be used 

having 3 repeat elements: 
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tA GTATAGGAACTTC 



FRT-M: 



GAAGTTCaTATTC TCTAGA 



Spacer ikiVercM^ 




tA GTATAtGAACTTC 



Ffaure !8 




mutant FIP genes 






indicator/Selector Bacterbl Strari 



wt FRT: GAAGTTCCTATTC TCTAGAAA GTATAGGAACTTC 

Note - alternatively, a full site can also be used 

having 3 repeat elements : 



GAAGTTCCTATTCCGAAGTTCCTATTC TCTAGAAA GTATAGGAACTTC 



FRT-A2: GAAGTTCCTATTC TCTAGA 



tA GTATAGGAACTTC 



a FRT-M2 



GAAGTTaCTATTC TCTAGA 




tA GTATAGtAACTTC 



Please type a plus sign (+) inside tliis box — > [jt] PTO/SB/01 (12-97) 

Approved for use tiirougii 9/30/00. 0MB 0651-0032 
Patent and Trademarl< Office; U.S. DEPARTMENT OF COMMERCE 
Under tfie Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains 
a valid 0MB control number. _____„„^ 



DECLARATION FOR UTILITY OR 
DESIGN 
PATENT APPLICATION 
(37CFR1.63) 



H Declaration 
Submitted 
witli Initial 
Filing 



D Declaration 
OR Submitted after Initial 
Filing (surcharge 
(37CFR 1.16(e)) 
required) 



Attorney Docket Number 



First Named Inventor 



OMRF 178 



Brian Lee Sauer 



COMPLETE !F KNOWN 



Application Number 



Filing Date 



Group Art Unit 



Examiner Name 



/ 



April 6, 2000 



Not Yet Assigned 



Not Yet Assigned 



As a below named Inventor, I hereby declare that: 

My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural 
names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 



METHOD FOR SELECTING RECOMBINASE VARIANT WITH 
ALTERED SPECIFICITY 



the specification of which 

' is attached hereto 
OR 

□ was filed on (MM/DDA'YYY) 



(Titfe of the invention) 



as United States Application Number or PCT International 



Application Number 



and was amended on (MM/DDATYY) 



(if applicable). 



I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as 
amended by any amendment specifically referred to above. 

I acknowledge the duty to disclose information which Is material to patentability as defined in 37 CFR 1.56. 



I hereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or 365(b) of any foreign application(s) for patent or inventor's 
certificate, or 365(a) of any PCT international application which designated at least one country other than the United States of 
America, listed below and have also identified below, by checking the box, any foreign application for patent or inventor's certificate, 
or of any PCT international application having a filing date before that of the application on which priority is claimed. 



Prior Foreign Application 
NumberCs) 



Country 



Foreign Filing Date 
(MM/DD/YYYY) 



Priority 
Not Claimed 



Certified Copy Attached? 
YES NO 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto: 
I hereby claim the benefit under 35 U.S.C, 119fe) of any United States provisional application(s) listed below. 



Application Nutnber(s) 



60/127,977 



Filing Date (MIWDD/YYYY) 



April 6, 1999 



I I Additional provisional application 
numbers are listed on a 
supplemental priority data sheet 
PTO/SB/02B attached hereto. 



+ 



[Page 1 of 2] 

Burden Hour Statement: This fomi is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the 
individual case. Any comments on the amount of time you are required to complete this form should be sent to the Chief Information 
Officer. Patent and Trademark Office, Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS 
ADDRESS. SEND TO: Assistant Commissioner for Patents, Washington, DC 20231 . 

OMRF 278 



20487/247 



April 6, 2000 

Method for Selecting Recombinase Variants with Altered Specificity 



Please type a plus sign {+) inside this box 



PTO/SB/01 (12-97) 
Approved for usetlirougli 9/30/00. 0MB 0651-0032 
Patent and Trademaric Office: U.S. DEPARTIVIENT OF COMMERCE 
Under the Paperwori< Reduction Act of 1 995, no persons are required to respond to a collection of Information unless it contains 
a valid OMB control number. 



DECLARATION — Utility or Design Patent Appiication 



I hereby claim the benefit under 35 U.S.C. 120 of any United States application(s), or 365(c) of any PCT international appiication designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 U.S.C. 112, 1 acknowledge the duty to disclose 
information which is material to patentability as defined in 37 CFR 1.56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. 



U.S. Parent Application or PCT Parent 
Number 



Parent Filing Date 
(MM/DD/YYYY) 



Parent Patent Number 
(if applicable) 



□ Additional U.S. or PCT international application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto. 



As a named Inventor, I hereby appoint the following registered practi tioner(s) to prosecute this app lication and to tra nsact all business in the Paten t 
and Trademark Office connected therewith: □ customer Number 



OR 



B Registered practltioner(s) name/registration number listed below 



Place Customer 
Number Bar Code 



Name 



Registration 
Number 



Name 



Registration 
Number 



Patrea L. Pabst 
Robert A. Hodges 
Kevin W. King 



31,284 
41,074 
42J37 



Additional registered practitione r(s) named on supplemental Registered Practitioner Information sheet PJO^^^^^^J^^j^^^^^jL!]^!^?^ 



Direct ail correspondence to: □ Customer Number 

or Bar Code Label 



OR H Correspondence address below 



Name 



Patrea L, Pabst 



Address 



Amall Golden & Gregory, LLP 



Address 



2800 One Atlantic Center, 1201 West Peachtree Street 



City 



Atlanta 



state 



GA 



ZIP 



30309-3450 



Country 



United States 



Telephone 



(404)873-8794 



Fax 



(404)873-8795 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Name of Sole or First inventor: 



□ A petition has been filed for this unsigned inventor 



Given Name (first and middle fif anvl) 



Family Name or finrnamft 



Brian Lee 



Sauer 



Inventor's 
Signature 



Date 



Residence: City 



Oklahoma City 





OK 




state 


Country 



USA 



Citizenship 



U.S. 



Post Office Address 



601 Robert S Kerr Avenue, Apt. 109 



Post Office Address 



City 



Oklahoma City 



State 



OK 



ZIP 



73102 



Country 



USA 



B Additional inventors are being named on the 1 ^supplemental Additional inventor(s) sheet(s) PTO/SB/02A attached hereto 



[Page 2 of 2] 



OMRF 178 
20487/247 



April 6, 2000 

Method for Selecting Recombinase Variants with Altered Specificity 



PTO/SB/02A (3-97) 
Approved for use through 9/30/98. OMB 0651 -0032 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains a 
valid OMB control number. 



Please type a plus sign (+) inside this box -^ j-j- [ 



+ 



DECLARATION 



ADDITIONAL iNVENTOR(S) 
Supplemental Sheet 

Page _L of 1 



Name of Additional Joint Inventor, If any: 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Andreas Walter 



Rufer 



Inventor's 
Signature 



Date 



Residence: City 



Oklahoma City 



state 



OK 



Country 



USA 



German 



Citizenship 



Post Office Address 



125 N.W. 15th Street, Apt. #604 



Post Office Address 



City 



Oklahoma City 





OK 




state 


ZIP 



73103 



Country 



USA 



Name of Additional Joint Inventor, if any: 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Inventor's 
Signature 



Date 



Residence: City 



State 



Country 



Citizenship 



Post Office Address 



Post Office Address 



City 



State 



Name of Additional Joint Inventor, If any: 



ZIP 



Country 



n A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Inventor's 
Signature 



Date 



Residence: City 



State 



I Country 



Citizenship 



Post Office Address 



Post Office Address 



City 



State 



ZIP 



Country 



+ 



Burden Hour Statement: This form is estimated to tal<e 0.4 hours to complete. Time will vary depending upon the needs of the individual case. Any 
comments on the amount of time you are required to complete this form should be sent to the Chief Information Officer, Patent and Trademarf< 
Office, Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for 
Patents, Washington, DC 20231 . 

OMRF 178 
20487/247 



